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ANTIMICROBIAL POLYPEPTIDES 
AND THEIR USES 

FIELD OF THE INVENTION 
The invention relates to plant disease resistance, particularly resistance to 
fungal pathogens. More specifically the present invention relates to the use of 
5 naturally occurring antimicrobial polypeptides isolated from insects induced with 
plant pathogens. 

BACKGROUND OF THE INVENTION 
Multicellular organisms produce a battery of antimicrobial peptides and 

0 proteins to defend themselves against microbial attack or injury. Many of these 
induced peptides and proteins possess broad antimicrobial activity against Gram- 
positive and/or Gram-negative bacteria (Boman, H.G. (1995) Annu. Rev. Immunol. 
13:61-92). This defense system, called "innate immunity," may represent a chemical 
barrier that organisms deploy to stop dangerous microbes at their point of contact. 

5 The peptides and proteins produced in response to microbial attack tend to 

work very differently from conventional antibiotics. Antibiotics work to block a 
crucial protein in an invading microbe. The mode of action of the antimicrobial 
defensive proteins varies. In some instances, they punch holes in a microbe's 
membranes and disrupt internal signaling of the microbe. In other instances, they 

0 may act to increase the host cell immune activity. 

Several antimicrobial peptides have been isolated and their structures partially 
characterized. The defensins, one type of the antimicrobial peptides, are cysteine-rich 
peptides. Defensins have been isolated from insects and mammals. Insect defensins 
are 34 - 43 amino acid peptides with three disulfide bridges. They are produced by 

5 the insect fat body (Hoffinann et al. (1992) Immunol. Today 13:41 1-15). They have 
been shown to disrupt the permeability of the cytoplasmic membrane of Micrococcus 
luteas, resulting from the formation of voltage-dependent ion channels in the 
cytoplasmic membrane (Cociancich et al. (1993) J. Biol. Chem. 268:19239-19245). 
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Thionins are another group of small cysteine-rich antimicrobial peptides. 
Thionins are thought to play a role in the protection of plants against microbial 
infection. They are found in the seed endosperm, stems, roots, and in etiolated or 
pathogen stressed leaves of many plant species (Bohlmann et ai (\99\) Arwu. Rev. 
5 Plant Physiol Plant Mol Biol 42:227-240). Thionins display toxicity to bacteria, 
fungi, yeasts, and even various mammalian cell types. 

Disease in plants has many causes including fungi, viruses, bacteria, and 
nematodes. Phytopathogenic fungi have resulted in significant annual crop yield 
losses as well as devastating epidemics. Additionally, plant disease outbreaks have 
10 resulted in catastrophic crop failures that have triggered famines and caused major 
social change. 

Molecular methods of crop protection not only have the potential to 
implement novel mechanisms for disease resistance, but can also be implemented 
more quickly than traditional breeding methods. Accordingly, molecular methods are 
1 5 needed to supplement traditional breeding methods to protect plants from pathogen 
attack. 

Plant pathogenic fiingi attack all of the approximately 300,000 species of 
flowering plants, but a single plant species can be host to only a few fungal species, 
and most fiingi usually have a limited host range. It is for this reason that the best 

20 general strategy to date for controlling plant fungal disease has been to use resistant 

cultivars selected or developed by plant breeders. Unfortunately, even with the use of 
resistant cultivars, the potential for serious crop disease epidemics persists today, as 
evidenced by outbreaks of Victoria oat and southern com leaf blight. 

Accordingly, molecular methods utilizing the resistance mechanisms of 

25 naturally occurring plant insect pests to enhance plant disease resistance to microbes, 
particularly pathogenic fungi, are desirable. 

SUMMARY OF THE INVENTION 
Compositions and methods for increasing resistance to pathogens are 
30 provided. The compositions comprise antipathogenic peptides or defensive agents 
that are induced in insects by contacting the insect with a pathogen of interest. The 
compositions include polypeptides that possess antimicrobial properties, particularly 
fungicidal properties, and the nucleic acid molecules that encode such polypeptides. 

-2- 
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The methods and compositions of the present invention find use in impacting plant 
microbial pathogens and in enhancing plant disease resistance to microbial pathogens. 

Expression cassettes comprising the nucleic acid molecules encoding the 
defensive agents, vector sequences and host cells for the expression of the 
5 polypeptides, and antibodies to the polypeptides are also provided. The compositions 
of the invention further provide plant cells, plants, and seed thereof, transformed with 
the nucleic acid molecules of the invention. The transgenic plants of the present 
invention are transformed with a nucleotide sequence of the invention and exhibit 
increased antimicrobial disease resistance, particularly fungal disease resistance that 
1 0 will lessen the need for artificial agricultural chemicals to protect field crops and 
increase crop yield. 

The methods of the invention involve stably transforming a plant with at least 
one expression cassette comprising at least one nucleotide sequence of the invention 
operably linked with a promoter capable of driving expression of the nucleotide 

1 5 sequence in the plant or plant cell. It is recognized that a variety of promoters will be 
useful in the invention, the choice of which will depend in part upon the desired tissue 
localization and the level of expression of the disclosed nucleotide sequences and 
corresponding polypeptides. It is recognized that the levels of expression of the 
defensive agents in the plant cell can be controlled so as to achieve optimal disease 

20 resistance. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1. Amino acid sequence alignment of precursor Magi polypeptide 
(SEQ ID NO:2) with the class of immune proteins known as attacins. The precursor 

25 Magi polypeptide has 78% sequence similarity with attacin E/F precursor polypeptide 
(SEQ ID NO: 19, Accession No: P01513). The remaining sequences are: Attacin A 
precursor polypeptide (SEQ ID NO: 17, Accession No: P50725); Attacin B precursor 
polypeptide (SEQ ID NO: 1 8, Accession No: P0 1 5 1 2); and the attacin precursor 
polypeptide known as Nuecin (SEQ ID NO:20, Accession No: Q2643 1). 

30 Figure 2. Amino acid sequence alignment of precursor Magi polypeptide 

(SEQ ID NO:2) with homologous polypeptide sequences of the invention encoded by 
cDNAs isolated from pathogen induced Manduca sexta libraries (SEQ ID NOS:4, 6, 
8, and 10). 
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Figure 3. The N-terminal amino acid sequences for the four Magi 
polypeptide Lys-C digestion fragments (SEQ ID NO:96, 97, 98, and 99). 



DETAILED DESCRIPTION OF THE INVENTION 
5 The present invention provides compositions and methods for enhancing plant 

disease resistance to plant pathogens, particularly fungal pathogens. The compositions 
of the invention include polypeptides and peptides that possess antimicrobial activity, 
particularly fungicidal activity. Such peptides or polypeptides are collectively 
referred to as "defensive agents" herein. Nucleic acid molecules encoding such 
10 defensive agents, as well as plants transformed with the nucleic acid molecules, are 
also included. 

The invention is drawn to compositions and methods for inducing resistance in 
a plant to plant pests. The defensive agents comprise insect derived nucleotide and 
polypeptide sequences. Accordingly, the compositions and methods are also useful in 

1 5 protecting plants against fungal pathogens, viruses, nematodes, and the like. 

Compositions for controlling plant pathogenic agents, particularly plant 
pathogenic microbial agents, more particularly plant pathogenic fungal agents are 
provided. Specific compositions provided include insect derived antimicrobial 
polypeptides, and the nucleic acid molecules encoding such polypeptides. Plants, 

20 plant cells, plant tissues and seeds thereof transformed with the nucleotide sequences 
of the invention are provided. Additionally, the compositions of the invention can be 
used in formulations for their disease resistance activities. 

The present invention provides for isolated nucleic acid molecules comprising 
nucleotide sequences encoding the amino acid sequences shown in SEQ ID NOS:2, 4, 

25 6, 8, 10, 12, 14, 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 
47, 49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 80, 
82, 83, 85, 86, 88, 89, 91, 92, 94, 95, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
119, 121, 123, 125, or 127. Further provided are polypeptides having an amino acid 
sequence encoded by a nucleic acid molecule described herein, for example, those set 

30 forth in SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13, 15, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 51, 
54, 57, 60, 63, 66, 69, 72, 75, 78, 81, 84, 87, 90, 93, 100, 102, 104, 106, 108, 110, 
1 12, 1 14, 1 16, 1 18, 120, 122, 124, or 126, and fragments and variants thereof. 

-4- 
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Methods are provided for the expression of these sequences in a host plant to 
confer enhanced disease resistance of the host plant to plant pathogens, particularly 
plant fiingal pathogens. The methods of the invention involve stably transforming a 
plant with at least one expression cassette comprising at least one nucleotide sequence 
5 of the invention operably linked with a promoter capable of driving expression of the 
nucleotide sequence in the plant cell. It is recognized that a variety of promoters will 
be useful in the invention, the choice of which will depend in part upon the desired 
level and desired tissue localization of expression of the disclosed nucleotide 
sequences. It is recognized that the levels and tissue location of expression can be 

1 0 controlled to modulate the levels of the antimicrobial polypeptides in the plant cell to 
optimize plant disease resistance to a particular pathogen. 

By "plant pathogen" or "plant pest" is intended any microorganism that can 
cause harm to a plant, such as by inhibiting or slowing the growth of a plant, by 
damaging the tissues of a plant, by weakening the immune system of a plant or the 

1 5 resistance of a plant to abiotic stresses, and/or by causing the premature death of the 
plant, etc. Plant pathogens and plant pests include microbes such as fungi, viruses, 
bacteria, and nematodes. 

By "disease resistance" or "pathogen resistance" is intended that the plants 
avoid the disease symptoms which are the outcome of plant pathogen interactions. 

20 That is, pathogens are prevented from causing plant diseases and the associated 

disease symptoms, or alternatively, the disease symptoms caused by the pathogen are 
minimized or lessened. The methods of the invention can be utilized to protect plants 
from disease, particularly those diseases that are caused by plant fungal pathogens. 
An "antimicrobial agent," a "pesticidal agent," a "defensive agent," and/or a 

25 "fungicidal agent" will act similarly to suppress, control, and/or kill the invading 
pathogen. 

A defensive agent will possess defensive activity. By "defensive activity" is 
intended an antipathogenic, antimicrobial, or antifungal activity. 

By "antipathogenic compositions" is intended that the compositions of the 
30 invention have activity against pathogens; including fungi, microorganisms, viruses, 
and nematodes, and thus are capable of suppressing, controlling, and/or killing the 
invading pathogenic organism. An antipathogenic composition of the invention will 
reduce the disease symptoms resulting from microbial pathogen challenge by at least 
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about 5% to about 50%, at least about 10% to about 60%, at least about 30% to about 
70%, at least about 40% to about 80%, or at least about 50% to about 90% or greater. 
Hence, the methods of the invention can be utilized to protect organisms, particularly 
plants, from disease, particularly those diseases that are caused by invading 

5 pathogens. 

Assays that measure antipathogenic activity are commonly known in the art, 
as are methods to quantify disease resistance in plants following pathogen infection. 
See, for example, U.S. Patent No. 5,614,395, herein incorporated by reference. Such 
techniques include, measuring over time, the average lesion diameter, the pathogen 
1 0 biomass, and the overall percentage of decayed plant tissues. For example, a plant 
either expressing an antipathogenic polypeptide or having an antipathogenic 
composition applied to its surface shows a decrease in tissue necrosis (i.e., lesion 
diameter) or a decrease in plant death following pathogen challenge when compared 
to a control plant that was not exposed to the antipathogenic composition. 
1 5 Alternatively, antipathogenic activity can be measured by a decrease in pathogen 

biomass. For example, a plant expressing an antipathogenic polypeptide or exposed 
to an antipathogenic composition is challenged with a pathogen of interest. Over 
time, tissue samples from the pathogen-inoculated tissues are obtained and RNA is 
extracted. The percent of a specific pathogen RNA transcript relative to the level of a 
20 plant specific transcript allows the level of pathogen biomass to be determined. See, 
for example, Thomma et al. (1998) Plant Biology 95:15107-15111, herein 
incorporated by reference. 

Furthermore, in vitro fungicidal assays include, for example, the addition of 
varying concentrations of the fungicidal composition to paper disks and placing the 
25 disks on agar containing a suspension of the pathogen of interest. Following 

incubation, clear inhibition zones develop around the discs that contain an effective 
concentration of the fungicidal polypeptide (Liu et al. (1994) Plant Biology 91 : 1888- 
1 892, herein incorporated by reference). Additional methods are used in the art to 
measure the in vitro fungicidal properties of a composition (Hu et al. (1997) Plant 
30 Mol. Biol. 34:949-959; Cammue et al (1992)./. Biol. Chem. 267: 2228-2233; and 
Thevissenera/. (1996) J. Biol. Chem. 271:15018-15025, all of which are herein 
incorporated by reference). 
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Pathogens of the invention include, but are not limited to, viruses or viroids, 
bacteria, insects, nematodes, fungi, and the like. Viruses include any plant virus, for 
example, tobacco or cucumber mosaic virus, ringspot virus, necrosis virus, maize 
dwarf mosaic virus, etc. Specific fungal and viral pathogens for the major crops 
5 include: Soybeans : Phytophthora megaspenna f. sp. glycinea, Macrophomina 
phaseolina, Rhizoctonia solani, Sclerotica sclerotiorum, Fnsarium oxysporum, 
Diaporthe phaseolorum var. sojae (Phomopsis sojae), Diaporthe phaseolorum var. 
caidivora, Sclerotium rolfsii, Cercospora kikuchii, Cercospora sojina, Peronospora 
manshurica, Colletotrichum dematium (Colletotichum truncatum), Corynespora 
1 0 cassiicola, Septoria glycines, Phyllosticta sojicola, Alternaria alternate*, 

Pseudomonas syringae p.v. glycinea, Xanthomonas campestris p.v. phaseoli, 
Microsphaera diffusa, Fusarium semitectum, Phialophora gregata, Soybean mosaic 
virus, Glomerella glycines, Tobacco Ring spot virus, Tobacco Streak virus, 
Phakopsora pachyrhizi, Pythium aphanidermatum, Pythium ultimum, Pythium 
1 5 debaryanum, Tomato spotted wilt virus, Heterodera glycines Fusarium solani; 

Canola: Albugo Candida, Alternaria brassicae, Leptosphaeria maculans, Rhizoctonia 
solani, Sclerotinia sclerotiorum, Mycosphaerella brassiccola, Pythium ultimum, 
Peronospora parasitica, Fusarium roseum, Alternaria alternata; Alfalfa : Clavibacter 
michiganensis subsp. insidiosum, Pythium ultimum, Pythium irregulare, Pythium 
20 splendens, Pythium debaryanum, Pythium aphanidermatum, Phytophthora 

megasperma, Peronospora trifoliorum, Phoma medicaginis var. medicaginis, 
Cercospora medicaginis, Pseudopeziza medicaginis, Leptotrochila medicaginis, 
Fusarium, Xanthomonas campestris p.v. alfalfae, Aphanomyces euteiches, 
Stemphylium herbarum, Stemphylium alfalfae; Wheat : Pseudomonas syringae p.v. 
25 atrofaciens, Urocystis agropyri, Xanthomonas campestris p.v. translucens, 

Pseudomonas syringae p.v. syringae, Alternaria alternata, Cladosporium herbarum, 
Fusarium graminearum, Fusarium avenaceum, Fusarium culmorum, Ustilago tritici, 
Ascochyta tritici, Cephalosporium gramineum, Collotetrichum graminicola, Erysiphe 
graminis f.sp. tritici, Puccinia graminis f.sp. tritici, Puccinia recondita fsp. tiitici, 
30 Puccinia striiformis, Pyrenophora tritici-repentis, Septoria nodorum, Septoria tritici, 
Septoria avenae, Pseudocercosporella herpotrichoides, Rhizoctonia solani, 
Rhizoctonia cerealis, Gaeumannomyces graminis var. tritici, Pythium 
aphanidermatum, Pythium arrhenomanes, Pythium ultimum, Bipolaris sorokiniana, 
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Barley Yellow Dwarf Vims, Brome Mosaic Virus, Soil Borne Wheat Mosaic Virus, 
Wheat Streak Mosaic Virus, Wheat Spindle Streak Virus, American Wheat Striate 
Virus, Claviceps purpurea, Tilletia tritici, Tilletia laevis, Ustilago tritici, Tilletia 
indica, Rhizoctonia solani, Pythium arrhenomanes, Pythium gramicola, Pythium 
5 aphanidermatum, High Plains Virus, European wheat striate virus; Sunflower : 

Plasmophora halstedii, Sclerotica sclerotiorum, Aster Yellows, Septoria helianthi, 
Phomopsis helianthi, Alternaria helianthi, Alternaria zinniae, Botiytis cinerea, 
Phoma macdonaldii, Macrophomina phaseolina, Erysiphe dehor acearum, Rhizopus 
oryzae, Rhizopus arrhizus, Rhizopus stolonifer, Puccinia helianthi, Verticillium 
1 0 dahliae, Erwinia carotovorum p.v. carotovora, Cephalosporium acremonium, 

Phytophthora cryptogea, Albugo tragopogonis; Corn : Fusarium moniliforme var. 
subglutinans, Erwinia stewartii, Fusarium verticilloides, Fusarium moniliforme, 
Gibberella zeae (Fusarium graminearum), Stenocarpella maydis (Diplodia maydis), 
Pythium irregulare, Pythium debaryanum, Pythium graminicola, Pythium splendens, 
1 5 Pythium ultimum, Pythium aphanidermatum, Aspergillus flavus, Bipolaris maydis O, 
T (Cochliobolus heterostrophus) , Helminthosporium carbonum I, II & III 
(Cochliobolus carbonum), Exserohilum turcicum I, II & in, Helminthosporium 
pedicellatum, Physoderma maydis, Phyllosticta maydis, Kabatiella maydis, 
Cercospora sorghi, Ustilago maydis, Puccinia sorghi, Puccinia polysora, 
20 Macrophomina phaseolina, Penicillium oxalicum, Nigrospora oryzae, Cladosporium 
herbarum, Curvularia lunata, Curvtdaria inaequalis, Curvularia pallescens, 
Clavibacter michiganense subsp. nebraskense, Trichoderma viride, Maize Dwarf 
Mosaic Virus A & B, Wheat Streak Mosaic Virus, Maize Chlorotic Dwarf Virus, 
Claviceps sorghi, Pseudomonas avenae, Erwinia chrysanthemi pv. zea, Erwinia 
25 carotovora, Corn stunt spiroplasma, Diplodia macrospora, Sclerophthora 
macrospora, Peronosclerospora sorghi, Peronosclerospora philippinensis, 
Peronosclerospora maydis, Peronosclerospora sacchari, Sphacelotheca reiliana, 
Physopella zeae, Cephalosporium maydis, Cephalosporium acremonium, Maize 
Chlorotic Mottle Virus, High Plains Virus, Maize Mosaic Virus, Maize Rayado Fino 
30 Virus, Maize Streak Virus, Maize Stripe Virus, Maize Rough Dwarf Virus; Sorghum : 
Exserohilum turcicum, Colletotrichum graminicola (Glomerella graminicola), 
Cercospora sorghi, Gloeocercospora sorghi, Ascochyta sorghina, Pseudomonas 
syringae p.v. syringae, Xanthomonas campestris p.v. holcicola, Pseudomonas 

-8- 
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andropogonis, Puccinia purpurea, Macrophomina phaseolina, Periconia circinata, 
Fusarium moniliforme, Alternaria alternate*, Bipolar is sorghicola, Helminthosporium 
sorghicola, Curvularia limata, Phoma insidiosa, Pseudomonas avenae (Pseudomonas 
alboprecipitans), Ramulispora sorghi, Ramulispora sorghicola, Phyllachara 
5 saccharic Sporisorium reilianum (Sphacelotheca reiliana), Sphacelotheca cruenta, 
Sporisorium sorghi, Sugarcane mosaic H, Maize Dwarf Mosaic Virus A & B, 
Claviceps sorghi, Rhizoctonia solani, Acremonium strictum, Sclerophthora 
macrospora, Peronosclerospora sorghi, Peronosclerospora philippinensis, 
Sclerospora graminicola, Fusarium graminearum, Fusarium oxysporum, Pythium 
1 0 arrhenomanes, Pythium graminicola, Rice: Magnaporthe grisea, Rhizoctonia solani, 
etc. 

The specific defensive agents of the invention have been demonstrated to have 
antipathogenic activity against particular pathogens. It is recognized that they may 
demonstrate activity against other pathogens, particularly other fungal pathogens. 

1 5 Some may even exhibit broad-spectrum antipathogenic activity. It is recognized that 
while antifungal polypeptides may demonstrate activity against a particular pest, such 
defensive agents may have activity against numerous fungal pathogens, as well as 
other plant pests. Thus, a plant transformed with a particular defensive agent of the 
invention may demonstrate broad-spectrum resistance. 

20 In one embodiment of the invention, defensive agents are isolated from the 

hemolymph of insect larvae induced by injection of a plant pathogenic fungi. The 
antimicrobial polypeptides induced can be placed into at least four groups according 
to their amino acid sequence homology to known classes of proteins. These four 
groups consist of the attacin, lebocin, and serine protease inhibitor classes of proteins, 

25 and a group that does not demonstrate substantial homology to known proteins. The 
defensive agents enhance disease resistance to fungal pathogens, Magnathorpa grisea 
(M. grisea), Rhizoctonia solani (R. solani), and Fusarium verticilloides (F. 
verticilloides). Specifically, the polypeptides of the invention were identified from 
the hemolymph of insect larvae induced by injection of the plant pathogenic fungi, M. 

30 grisea, R. solani, or F verticilloides. 

The compositions of the invention comprise M sexta (tobacco homworm), 
Heliothis virescens (tobacco budworm), Ostrinia nubilalis (European cornborer), 
Peregrinus maidis (cornplant hopper), Helicoverpa zea (corn earworm), and Agrotis 
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ipsilon (Black cutworm) nucleic acid and amino acid sequences. Particularly, an M 
sexta Ml-length cDNA, herein designated, Magi (SEQ ID NO:l), and corresponding 
amino acid sequence (SEQ ID NO:2); an M sexta full-length cDNA, herein 
designated, Rhizoc2 or iimlc.pk003.f3 (SEQ ID NO:3), and corresponding amino 
5 acid sequence (SEQ ID NO:4); an M. sexta partial cDNA, herein designated, 

iiglc.pk004.f3 (SEQ ID NO:5), and corresponding amino acid sequence (SEQ ID 
NO:6); an M sexta partial cDNA, herein designated imilc.pk001.h7 (SEQ ID NO:7), 
and corresponding amino acid sequence (SEQ ID NO: 8), an M sexta partial cDNA, 
herein designated imilc.pk002.m21 (SEQ ID NO:9), and corresponding amino acid 

1 0 sequence (SEQ ID NO: 1 0); an M sexta full-length cDNA, herein designated, Rhizocl 
(SEQ ID NO: 11), and corresponding amino acid sequence (SEQ ID NO: 12); an M 
sexta fiill-length cDNA, herein designated, Fusl (SEQ ID NO: 13), and corresponding 
amino acid sequence (SEQ ID NO: 14); and an M sexta full-length cDNA, herein 
designated, Rhizoc3 (SEQ ID NO: 15), and corresponding amino acid sequence (SEQ 

15 ID NO: 16). 

The mature Magi polypeptide was isolated from the hemolymph of M. sexta 
larvae induced by injection of the plant pathogenic fungus M grisea. The Magi 
precursor polypeptide consists of 206 amino acids. This polypeptide belongs to a 
broad class of insect immune proteins known as attacins that were originally isolated 

20 from Hyalophora cecropia. A Magi precursor polypeptide-encoding cDNA (SEQ ID 
NO: 1) was subsequently isolated from a cDNA library derived from the fatbodies of 
pathogen induced M. sexta. The Magi precursor polypeptide shares 78% sequence 
similarity with attacin E/F precursor (SEQ ID NO: 19, Figure 1). 

Attacin proteins are induced upon injection of insects (mostly lepidopteran 

25 species) with bacteria, and have been demonstrated to possess antibacterial properties 
(Kockum et al. (1984) EMBO J. 3:2071-2075; Engstrom et al (1984) EMBO J. 
3:2065-2070; Engstrom et al. (1984) EMBO J. 3:3347-3351; Bowman et aL (1985) 
Dev. Comp. Immunol 9:551-558; Sun et al (1991) Eur. J. Biochem. 196:247-254; 
and Ko, K. (2000) http://www.scisoc.org/feature/BioTechnology/antimicrobial.html). 

30 The Magi polypeptide was induced by injection of an insect with a plant pathogenic 
fungus, rather than by induction with a bacteria. Furthermore, the isolated Magi 
polypeptide demonstrates fungicidal activity at low concentrations against the plant 
pathogen M grisea (see Example 1). 
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In addition, the polypeptides set forth in SEQ ID NOS:6, 8, and 10, and 
encoded by the cDNA clones, iiglc.pk004.O, imilc.pkOOl .h7, and imilc.pk002.m21, 
respectively, are also attacin homologs. These polypeptides display about 48 to 
62.3% sequence identity to the Magi polypeptide (SEQ ID NO:2) (see Figure 2). 
5 These cDNA clones were isolated from M grisea (iiglc.pk004.f3) and B. bassiana 
(imilc.pkOOl .h7 and imilc.pk002.m21) induced M sexta derived cDNA libraries. 

Similar to the Magi precursor polypeptide, the Rhizoc2 (SEQ ID NO:3) 
precursor polypeptide also shares sequence homology to the attacin class of proteins. 
The Rhizoc2 precursor polypeptide shares 75% sequence similarity and 68% 

10 sequence identity with the attacin E/F precursor protein shown in Figure 1 (SEQ ID 
NO: 19). The cDNA encoding the Rhizoc2 precursor polypeptide (SEQ ID NO:3) was 
isolated from a cDNA library derived from the fatbodies of R. solani induced M. 
sexta. The Rhizoc2 precursor polypeptide consists of 196 amino acids and the mature 
polypeptide demonstrates fungicidal activity at low concentrations against the plant 

1 5 pathogen R. solani (see Example 1). The partial cDNA imilc.pkOOl .h7 identified 
from a B. bassiana induced M sexta library is a fragment of the Rhizoc2 sequence. 

Another polypeptide, designated Rhizocl, with homology to the lebocin class 
of insect immune proteins, was similarly isolated from the hemolymph of M. sexta 
larvae induced by injection of the plant pathogenic fungus R. solani. A Rhizocl 

20 precursor polypeptide-encoding cDNA (SEQ ID NO: 1 1) was subsequently isolated 
from a cDNA library derived from the fatbodies of M. grisea induced M. sexta. The 
Rhizocl precursor polypeptide consists of 142 amino acids and shares 65% sequence 
similarity and 61% sequence identity with lebocin 4 precursor protein (Accession No: 
JC5666). 

25 The Rhizocl polypeptide demonstrates fungicidal activity at low 

concentrations against the plant pathogens R. solani and F. verticilloides (see 
Example 1). Unlike other members of the lebocin class of polypeptides, the Rhizocl 
polypeptide was induced upon injection of an insect with a plant fungal pathogen, 
rather than by induction with a bacteria. Indeed, other lebocin polypeptides have been 

30 demonstrated to possess antibacterial rather than fungicidal properties (Hara and 

Yamakawa (1995) Biochem. J. 310:651-656; Chowdhury, S. et al (1995) Biochem. 
Biophys. Res. Com. 214:271-278; and Furukawa, S. et al (1997) Biochem. Biophys. 
Res. Com. 238:769-774). 
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Additional Rhizocl homologs have been identified. The nucleotide sequences 
of the Rhizocl homologs are set forth in SEQ ID NOS.27, 33, 45, 48, 51, 72, 81, and 
84. The amino acid sequences of the Rhizocl homologs are set forth in SEQ ID 
NOS:28, 29, 34, 35, 46, 47, 49, 50, 52, 53, 73, 74, 82, 83, 85, and 86. 
5 A mature polypeptide, designated Fusl , was isolated from the hemolymph of 

M. sexta larvae induced by injection of the plant pathogenic fungus F. verticilloides. 
This polypeptide demonstrates fungicidal activity at low concentrations against the 
plant pathogen F. verticilloides (see Example 1). A cDNA encoding the mature Fusl 
polypeptide and part of the signal sequence (SEQ ID NO: 13) was subsequently 
1 0 isolated from a cDN A library derived from the fatbodies of M. grisea induced M. 
sexta. 

The Fusl polypeptide of the invention is homologous to several proteins 
isolated from insect species that belong to the class of proteins known as the serine 
protease inhibitors (Frobius et al. (2000) Eur. J. Biochem. 267:2046-2053; Ramesh et 

15 al. (1988) J. Biol. Chem. 263:1 1523-1 127; and Sasaki, T (1988) Biol. Chem. 

369:1235-1241). The Fusl polypeptide has about 47% sequence similarity to these 
proteins. The polypeptides identified by Frobius et al. were isolated from Galleria 
mellonella hemolymph after injection of larvae with a yeast polysaccharide 
preparation, and demonstrate inhibition of serine proteases from the 

20 entomopathogenic fungus, Metarhizium anisopliae, an insect pathogen. A codon- 

biased Fusl nucleotide sequence linked to the BAA signal sequence has been created. 
The codon-biased Fusl nucleotide sequence was developed according to the codon 
bias of M sexta. The codon-biased BAA-Fusl nucleotide sequence is set forth in SEQ 
ID NO: 120 and the codon-biased Fusl sequence is set forth in SEQ ID NO: 122. The 

25 amino acid sequence of the BAA-Fusl polypeptide is set forth in SEQ ID NO: 1 2 1 and 
SEQ ID NO: 123. 

Additional Fusl homologs have been identified. The nucleotide sequences of 
the Fusl homologs are set forth in SEQ ID NOS:21, 36, and 78. The amino acid 
sequences of the Fusl homologs are set forth in SEQ ID NOS.22, 23, 37, 38, 79, and 
30 80. 

A mature polypeptide designated, Rhizoc3, was isolated from the hemolymph 
of M. sexta larvae induced by injection of the plant pathogenic fungus R. solani. This 
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polypeptide demonstrates fungicidal activity at low concentrations against the plant 
pathogen R. solani (see Example 1). 

A Rhizoc3 precursor polypeptide encoding cDNA (SEQ ID NO: 15) was 
subsequently isolated from a cDNA library derived from the fatbodies of M grisea 

5 induced M sexta. The Rhizoc3 precursor polypeptide consists of 61 amino acids and 
does not demonstrate sequence homology to any known proteins. 

Homologs of Fus4 have been identified. The nucleotide sequences of the Fus4 
homologs are set forth in SEQ ID NOS.24, 30, 39, 42, 54, 57, 60, 63, 66, 69, 75, 87, 
90, and 93. The amino acid sequences of the Fus4 homologs are set forth in SEQ ID 

0 NOS:25, 26, 31, 32, 40, 41, 43, 44, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 76, 
77, 88, 89,91,92, 94, and 95. 

Additional polypeptides active against Fusarium species have been identified 
from Agrotis ipsilon. The Fus6, Fus7, Fus8, Fus9, and FuslO nucleotide sequences 
are set forth in SEQ IDNOS:100, 102, 104, 106, 108, 110, 112, 114, 116, and 118. 

5 The amino acid sequences of the Fus6 ? Fus7, Fus8, Fus9, and Fusl 0 polypeptides are 
setforthinSEQIDNOSrlOl, 103, 105, 107, 109, 111, 113, 115, 117, and 119. 

A codon-biased Fus2 nucleotide sequence linked to the BAA signal sequence 
has been created. The codon-biased BAA-Fus2 nucleotide sequence is set forth in 
SEQ ID NO: 124 and the codon-biased Fus2 sequence is set forth in SEQ ID NO: 126. 

0 The amino acid sequence of the BAA-Fus2 polypeptide is set forth in SEQ ID 
NO: 125 and SEQ ID NO: 127. 

The polypeptides encoded by the nucleotide sequences of the invention may 
be processed into mature peptides as discussed elsewhere herein. The region from 
nucleotide 169 to nucleotide 298 of SEQ ID NO: 1 1 encodes the mature Rhizocl 

5 peptide. The region from nucleotide 58 to nucleotide 624 of SEQ ID NO:3 encodes 
the mature Rhizoc2 peptide. The region from nucleotide 86 to nucleotide 208 of SEQ 
ID NO: 15 encodes the mature Rhizoc3 peptide. The region from nucleotide 46 to 
nucleotide 216 of SEQ ID NO: 13 encodes the mature Fusl peptide. The nucleotide 
sequence set forth in SEQ ID NO: 102 encodes the mature Fus6 peptide, the amino 

0 acid sequence of which, is set forth in SEQ ID NO:l 03. The nucleotide sequence set 
forth in SEQ ID NO: 1 06 encodes the mature Fus7 peptide, the amino acid sequence 
of which, is set forth in SEQ ID NO: 107. The nucleotide sequence set forth in SEQ ID 
NO:l 10 encodes the mature Fus8 peptide, the amino acid sequence of which, is set 
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forth in SEQ ID NO:1 1 1 . The nucleotide sequence set forth in SEQ ID NO:l 14 
encodes the mature Fus9 peptide, the amino acid sequence of which, is set forth in 
SEQ ID NO: 115. The nucleotide sequence set forth in SEQ ID NO: 1 1 8 encodes the 
mature FuslO peptide, the amino acid sequence of which, is set forth in SEQ ID 
5 NO: 119. 

Fragments and variants of the disclosed nucleotide sequences and 
polypeptides encoded thereby are also encompassed by the present invention. By 
"fragment" is intended a portion of the nucleotide sequence or a portion of the amino 
acid sequence. Fragments of a nucleotide sequence may encode polypeptide 
1 0 fragments that retain the biological activity of the native protein and hence possess 
antimicrobial and/or fungicidal activity. By "antimicrobial activity" or "fungicidal 
activity" is intended the ability to suppress, control, and/or kill the invading 
pathogenic microbe or fungus, respectively. A composition of the invention that 
possesses antimicrobial or fungicidal activity will reduce the disease symptoms 
1 5 resulting from microbial or fungal pathogen challenge by at least about 5% to about 

50%, at least about 10% to about 60%, at least about 30% to about 70%, at least about 
40% to about 80%, or at least about 50% to about 90% or greater. Alternatively, 
fragments of a nucleotide sequence that are useful as hybridization probes generally 
do not encode fragment proteins retaining biological activity. Thus, fragments of a 
20 nucleotide sequence may range from at least about 20 nucleotides, about 50 

nucleotides, about 100 nucleotides, and up to the full-length nucleotide sequence 
encoding the proteins of the invention. 

Alternatively, fragments of a nucleotide sequence of the invention may encode 
polypeptide fragments that are antigenic, thus, they are capable of eliciting an immune 
25 response. An "antigenic polypeptide" is herein defined as a polypeptide that is 

capable of generating an antibody. Antigenic polypeptide fragments of the disclosed 
amino acid sequences are also encompassed by the invention. 

A nucleotide fragment of SEQ ID NO:l that encodes a biologically active or 
antigenic portion of the amino acid sequence of SEQ ID NO:2 (Magi), will encode at 
30 least 30, 35, 40, 45, 50, 55, 60, 70, 80, 90, 100, 150, or 200 contiguous amino acids, 
or up to the total number of amino acids (206) present in SEQ ID NO:2. 

A nucleotide fragment of SEQ ID NO:3 that encodes a biologically active or 
antigenic portion of the amino acid sequence of SEQ ID NO:4 (Rhizoc2), will encode 

-14- 



_02086072A2J_> 



WO 02/086072 PCT/US02/12511 

at least 1 5, 20 ? 25, 30, 35, 40, 45, 50, 55, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 
160, 170, 1 80, or 190 contiguous amino acids, or up to the total number of amino 
acids (1 96) present in SEQ ID NO:4. 

A nucleotide fragment of SEQ ID NO:5 that encodes a biologically active or 
5 antigenic portion of the amino acid sequence of SEQ ID NO:6 (iiglc.pk004.f3), will 
encode at least 25, 30, 35, 40, 45, 50, 55, 60, or 70 contiguous amino acids, or up to 
the total number of amino acids (80) present in SEQ ID NO:6. 

A nucleotide fragment of SEQ ID NO:7 that encodes a biologically active or 
antigenic portion of the amino acid sequence of SEQ ID NO:8 (imilc.pkOOl .h7), will 
10 encode at least 25, 30, 35, 40, 45, 50, 55, 60, 70, 80, 90, 100, or 1 10 contiguous amino 
acids, or up to the total number of amino acids (1 1 1) present in SEQ ID NO:8. 

A nucleotide fragment of SEQ ID NO:9 that encodes a biologically active or 
antigenic portion of the amino acid sequence of SEQ ID NO: 10 (imilc.pk002.m21), 
will encode at least 25, 30, 35, 40, 45, 50, 55, 60, 70, 80, 90, 100, 1 10, 120, 130, or 
15 140 contiguous amino acids, or up to the total number of amino acids (148) present in 
SEQ ID NO: 10. 

A nucleotide fragment of SEQ ID NO:l 1 that encodes a biologically active or 
antigenic portion of the amino acid sequence of SEQ ID NO: 12 (Rhizocl), will 
encode at least 15, 20, 25, 35, 40, 45, 50, 55, 60, 70, 80, 90, 100, 1 10, 120, 130, or 
20 140 contiguous amino acids, or up to the total number of amino acids (142) present in 
SEQ ID NO: 12. 

A nucleotide fragment of SEQ ID NO: 13 that encodes a biologically active or 
antigenic portion of the amino acid sequence of SEQ ID NO: 14 (Fusl), will encode at 
least 25, 30, 35, 40, 45, 50, 55, 60, 65, or 70 contiguous amino acids, or up to the total 
25 number of amino acids (7 1 ) present in SEQ ID NO: 1 4. 

A nucleotide fragment of SEQ ID NO: 1 5 that encodes a biologically active or 
antigenic portion of the amino acid sequence of SEQ ID NO: 16 (Rhizoc3), will 
encode at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, or 60 contiguous amino acids, 
or up to the total number of amino acids (61) present in SEQ ID NO: 1 6. 
30 A nucleotide fragment of SEQ ID NO:l, 3, 5, 7, 9, 1 1, 13, 15, 21, 24, 27,30, 

33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, 75, 78, 81, 84/87, 90, 93, 100, 
102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, or 126 that encodes a 
biologically active or antigenic portion of the amino acid sequence of SEQ ID NO:2, 
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4, 6, 8, 10, 12, 14, 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 
47, 49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 80, 
82, 83, 85, 86, 88, 89, 91,92,94, 95, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
1 19, 121, 123, 125, or 127, will encode at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, or 
5 55 contiguous amino acids, or up to the total number of amino acids present in SEQ 
ID NO:2, 4, 6, 8, 10, 12, 14, 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 
43, 44, 46, 47, 49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 
77, 79, 80, 82, 83, 85, 86, 88, 89, 91, 92, 94, 95, 101, 103, 105, 107, 109, 111, 113, 
115, 117, 119, 121, 123, 125, or 127. 
1 0 A biologically active or antigenic portion of a polypeptide sequence set forth 

in SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 
40, 41, 43, 44, 46, 47, 49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 
74, 76, 77, 79, 80, 82, 83, 85, 86, 88, 89, 91, 92, 94, 95, 101, 103, 105, 107, 109, 1 1 1, 
1 1 3, 1 1 5, 1 1 7, 1 19, 121, 123, 125, or 127 can be prepared by isolating a portion of 
15 one of the nucleotide sequences set forth in SEQ ID NO.l, 3, 5, 7, 9, 1 1, 13, 15, 21, 
24, 27,30, 33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, 75, 78, 81, 84, 87, 90, 
93, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, or 126, 
expressing the encoded portion of the polypeptide (e.g., by recombinant expression in 
vitro), and assessing the activity of the encoded portion of the polypeptide. 
20 Alternatively, fragments of a nucleotide sequence that are useful as 

hybridization probes generally do not encode fragment polypeptides retaining 
biological activity. Thus, fragments of a nucleotide sequence may range from at least 
about 1 5 nucleotides, about 30 nucleotides, about 50 nucleotides, about 1 00 
nucleotides, and up to the full-length nucleotide sequence encoding the polypeptides 
25 of the invention. 

Fragments of the nucleotide sequence set forth in SEQ ID NO:l, from 
nucleotide 4 to 621, may range from at least 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 
125, 150, 200, 300, 400, or 500 contiguous nucleotides, or up to the total number of 
nucleotides (618) present in SEQ ID NO. l that encode SEQ ID NO:2 (Magi). 
30 Fragments of the nucleotide sequence set forth in SEQ ID NO:3, from 

nucleotide 34 to 624, may range from at least 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 
125, 150, 200, 300, 400, or 500 contiguous nucleotides, or up to the total number of 
nucleotides (588) present in SEQ ID NO:3 that encode SEQ ID NO:4 (Rhizoc2). 
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Fragments of the nucleotide sequence set forth in SEQ ID NO:5 
(iiglc.pk004.f3), from nucleotide 4 to 249, may range from at least 15, 20, 30, 40, 50, 
60, 70, 80, 90, 100, 125, 150, or 200 contiguous nucleotides, or up to the total number 
of nucleotides (240) present in SEQ ID NO:5 that encode SEQ ID NO:6. 
5 Fragments of the nucleotide sequence set forth in SEQ ID NO:7 

(imilc.pkOOl .h7), from nucleotide 4 to 336, may range from at least 15, 20, 30, 40, 
50, 60, 70, 80, 90, 100, 125, 150, 200, 250, or 300 contiguous nucleotides, or up to the 
total number of nucleotides (333) present in SEQ ID NO:7 that encode SEQ ID NO:8. 
SEQ ID NO:7 is a fragment of the nucleotide sequence set forth in SEQ ID NO:3. 
1 0 Fragments of the nucleotide sequence set forth in SEQ ID NO:9 

(imilc.pk002.m21), from nucleotide 4 to 447, may range from at least 15, 20, 30, 40, 
50, 60, 70, 80, 90, 100, 125, 150, 200, 250, 300, 350, or 400 contiguous nucleotides, 
or up to the total number of nucleotides (444) present in SEQ ID NO:9 that encode 
SEQ ID NO: 10. 

1 5 Fragments of the nucleotide sequence set forth in SEQ ID NO: 1 1 (Rhizocl), 

from nucleotide 28 to 456, may range from at least 15, 20, 30, 40, 50, 60, 70, 80, 90, 
100, 125, 150, 200, 250, 300, 350, or 400 contiguous nucleotides, or up to the total 
number of nucleotides (426) present in SEQ ID NO:l 1 that encode SEQ ID NO: 12. 

Fragments of the nucleotide sequence set forth in SEQ ID NO: 13 (Fusl), from 

20 nucleotide 22 to 237, may range from at least 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 
125, 150, 175, or 200 contiguous nucleotides, or up to the total number of nucleotides 
(216) present in SEQ ID NO:13 that encode SEQ ID NO:14. 

Fragments of the nucleotide sequence set forth in SEQ ID NO: 1 5 (Rhizoc3), 
from nucleotide 23 to 208, may range from at least 15, 20, 30, 40, 50, 60, 70, 80, 90, 

25 1 00, 1 25, or 1 50 contiguous nucleotides, or up to the total number of nucleotides 
(1 83) present in SEQ ID NO: 1 5 that encode SEQ ID NO: 1 6. 

Fragments of the nucleotide sequence set forth in SEQ ID NO:21, 24, 27,30, 
33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, 75, 78, 81, 84, 87, 90, or 93 may 
range from at least 15, 20, 30, 40, 50, 60, 70, 80, 90, 100, 125, or 150 contiguous 

30 nucleotides, or up to the total number of nucleotides present in SEQ ID NO:21 , 24, 
27,30, 33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, 75, 78, 81, 84, 87, 90, or 
93 that encode SEQ ID NO:22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 
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44, 46, 47, 49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 
79, 80, 82, 83, 85, 86, 88, 89, 91, 92, 94, or 95, respectively. 

Fragments of the nucleotide sequence set forth in SEQ ID NO: 100 (Fus6), 
from nucleotide 1 to 195, may range from at least 15, 20, 30, 40, 50, 60, 70, 80, 90, 
5 100, 125, 150, 175, 180, 185, 190, or 195 contiguous nucleotides, or up to the total 
number of nucleotides (358) present in SEQ ID NO: 1 00 that encode SEQ ID NO: 101. 

Fragments of the nucleotide sequence set forth in SEQ ID NO: 104 (Fus7), 
from nucleotide 1 to 195, may range from at least 15, 20, 30, 40, 50, 60, 70, 80, 90, 
100, 125, 150, 175, 180, 185, 190, or 195 contiguous nucleotides, or up to the total 
1 0 number of nucleotides (387) present in SEQ ID NO: 104 that encode SEQ ID NO: 1 05. 

Fragments of the nucleotide sequence set forth in SEQ ID NO: 108 (Fus8), 
from nucleotide 1 to 195, may range from at least 15, 20, 30, 40, 50, 60, 70, 80, 90, 
100, 125, 150, 175, 180, 185, 190, or 195 contiguous nucleotides, or up to the total 
number of nucleotides (361) present in SEQ ID NO: 108 that encode SEQ ID NO: 109. 
1 5 Fragments of the nucleotide sequence set forth in SEQ ID NO: 1 1 2 (Fus9), 

from nucleotide 1 to 195, may range from at least 15, 20, 30, 40, 50, 60, 70, 80, 90, 
100, 125, 150, 175, 180, 185, 190, 195, 200, 210, 220, 230, 240, 250, 260, 270, 280, 
290, or 291 contiguous nucleotides, or up to the total number of nucleotides (466) 
present in SEQ ID NO:l 12 that encode SEQ ID NO:l 13. 
20 Fragments of the nucleotide sequence set forth in SEQ ID NO: 1 1 6 (Fus 1 0), 

from nucleotide 1 to 195, may range from at least 15, 20, 30, 40, 50, 60, 70, 80, 90, 
100, 125, 150, 175, 180, 185, 190, 195, 200, 210, or 220 contiguous nucleotides, or 
up to the total number of nucleotides (372) present in SEQ ID NO.l 16 that encode 
SEQ ID NO: 117. 

25 The invention encompasses isolated or substantially purified nucleic acid or 

protein compositions. An "isolated" or "purified" nucleic acid molecule or protein, or 
biologically active portion thereof, is substantially free of other cellular material, or 
culture medium when produced by recombinant techniques, or substantially free of 
chemical precursors or other chemicals when chemically synthesized. Preferably, an 

30 "isolated" nucleic acid is free of sequences (preferably protein encoding sequences) 
that naturally flank the nucleic acid (i.e., sequences located at the 5 1 and 3' ends of the 
nucleic acid) in the genomic DNA of the organism from which the nucleic acid is 
derived. For example, in various embodiments, the isolated nucleic acid molecule can 
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contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleotide 
sequences that naturally flank the nucleic acid molecule in genomic DNA of the cell 
from which the nucleic acid is derived. 

A protein that is substantially free of cellular material includes preparations of 
5 protein having less than about 30%, 20%, 10%, 5%, (by dry weight) of contaminating 
protein. When the protein of the invention or biologically active portion thereof is 
recombinantly produced, preferably culture medium represents less than about 30%, 
20%, 10%, or 5% (by dry weight) of chemical precursors or non-pro tein-of-interest 
chemicals. 

1 0 By "variants" is intended substantially similar sequences. For nucleotide 

sequences, conservative variants include those sequences that, because of the 
degeneracy of the genetic code, encode the amino acid sequence of one of the 
polypeptides of the invention. Naturally occurring allelic variants such as these can 
be identified with the use of well-known molecular biology techniques, as, for 

1 5 example, with polymerase chain reaction (PCR) and hybridization techniques as 
outlined below. Variant nucleotide sequences also include synthetically derived 
nucleotide sequences, such as those generated, for example, by using site-directed 
mutagenesis but that still encode a polypeptide of the invention. Generally, variants of 
a particular nucleotide sequence of the invention will have at least 40%, 50%, 60%, 

20 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 

99%, or more sequence identity to that particular nucleotide sequence as determined 
by sequence alignment programs described elsewhere herein using default parameters. 

By "variant" polypeptide is intended a polypeptide derived from the native 
polypeptide by deletion (so-called truncation) or addition of one or more amino acids 

25 to the N-terminal and/or C-terminal end of the native polypeptide; deletion or addition 
of one or more amino acids at one or more sites in the native polypeptide; or 
substitution of one or more amino acids at one or more sites in the native polypeptide. 
Variant polypeptides encompassed by the present invention are biologically active, 
that is, they continue to possess the desired biological activity of the native 

30 polypeptide, hence they will continue to possess antimicrobial and/or fungicidal 

activity. Such variants may result from, for example, genetic polymorphism or from 
human manipulation. 
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Biologically active variants of a native polypeptide of the invention will have 
at least 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, 99% or more sequence identity to the amino acid sequence for the 
native polypeptide as determined by sequence alignment programs described 
5 elsewhere herein using default parameters. A biologically active variant of a 
polypeptide of the invention may differ from that polypeptide by as few as 1-15 
amino acid residues, as few as 1-10, such as 6-10, as few as 5, as few as 4, 3, 2, or 
even 1 amino acid residue. 

Biological activity of the polypeptides of the present invention can be assayed 
10 by any method known in the art (see for example, U.S. Patent No. 5,614,395; 

Thomma et al. (1998) Plant Biology 95:15107-151 1 1; Liu et al. (1994) Plant Biology 
91.1 888-1 892; Hu et al. (1 997) Plant Mol. Biol. 34:949-959; Cammue et al. (1 992) J. 
Biol. Chem. 267: 2228-2233; and Thevissen et al. (1996)./. Biol. Chem. 271:15018- 
15025, all of which are herein incorporated by reference). 
1 5 The polypeptides of the invention may be altered in various ways including 

amino acid substitutions, deletions, truncations, and insertions. Methods for such 
manipulations are generally known in the art. For example, amino acid sequence 
variants of the polypeptides of the invention can be prepared by mutations in the 
DNA. Methods for mutagenesis and nucleotide sequence alterations are well known 
20 in the art. See, for example, Kunkel (1985) Proc. Natl. Acad. Sci. USA 82:488-492; 
Kunkel et al. (1987) Methods in Enzymol. 154:367-382; US Patent No. 4,873,192; 
Walker and Gaastra, eds. (1983) Techniques in Molecular Biology (MacMillan 
Publishing Company, New York) and the references cited therein. Guidance as to 
appropriate amino acid substitutions that do not affect biological activity of the 
25 polypeptide of interest may be found in the model of Dayhoff et al. ( 1 978) A tlas of 
Protein Sequence and Structure (Natl. Biomed. Res. Found., Washington, D.C.), 
herein incorporated by reference. Conservative substitutions, such as exchanging one 
amino acid with another having similar properties, may be preferred. 

Thus, the genes and nucleotide sequences of the invention include both the 
30 naturally occurring sequences as well as mutant forms. Likewise, the polypeptides of 
the invention encompass both naturally occurring polypeptides as well as variations 
and modified forms thereof. Such variants will continue to possess the desired 
antimicrobial, or in some cases, fungicidal activity. Obviously, the mutations that wiil 
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be made in the DNA encoding the variant must not place the sequence out of reading 
frame and preferably will not create complementary regions that could produce 
secondary niRNA structure. See, EP Patent Application Publication No. 75,444. 
The deletions, insertions, and substitutions of the polypeptide sequences 
5 encompassed herein are not expected to produce radical changes in the characteristics 
of the polypeptide. However, when it is difficult to predict the exact effect of the 
substitution, deletion, or insertion in advance of doing so, one skilled in the art will 
appreciate that the effect will be evaluated by routine screening assays for 
antimicrobial and/or fungicidal activity as referenced supra. 

10 Variant nucleotide sequences and polypeptides also encompass sequences and 

polypeptides derived from a mutagenic and recombinogenic procedure such as DNA 
shuffling. With such a procedure, one or more different coding sequences in the 
nucleic acid molecules described in SEQ ID NO:l, 3, 5, 7, 9, 1 1, 13, 15, 21, 24, 
27,30, 33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, 75, 78, 81, 84, 87, 90, 93, 

15 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, or 126 can be 

manipulated to create a new polypeptides possessing the desired properties. In this 
manner, libraries of recombinant polynucleotides are generated from a population of 
related sequence polynucleotides comprising sequence regions that have substantial 
sequence identity and can be homologously recombined in vitro or in vivo. For 

20 example, using this approach, sequence motifs encoding a domain of interest may be 

shuffled between the nucleic acid molecules of the invention and other known 

antimicrobial encoding nucleotide sequences to obtain a new nucleotide sequence 

coding for a polypeptide with an improved property of interest, such as increased 

antimicrobial and/or fungicidal properties at lower polypeptide concentrations or 

25 specificity for particular plant pathogens. For example, specificity for a particular 

» 

plant fungal pathogen including, but not limited to, pathogens such as M. grisea and 
F. verticittoides. Strategies for such DNA shuffling are known in the art. See, for 
example, Stemmer (1994) Proc. Natl. Acad. Sci. USA 91:1.0747-10751; Stemmer 
(1994) Nature 370:389-391; Crameri et al. (1997) Nature Biotech. 15:436-438; 
30 Moore et al. (1997) J. Mol. Biol. 272:336-347; Zhang et al. (1997) Proc. Natl. Acad. 
Sci. USA 94:4504-4509; Crameri et al. (1998) Nature 391:288-291; and U.S. Patent 
Nos. 5,605,793 and 5,837,458. 
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The nucleotide sequences of the invention can be used to isolate 
corresponding sequences from other organisms, particularly other insects. In this 
manner, methods such as PCR, hybridization, and the like can be used to identify such 
sequences based on their sequence homology to the sequences set forth herein. 
5 Sequences isolated based on their sequence identity to the full-length nucleotide 
sequences set forth herein or to fragments thereof are encompassed by the present 
invention. Such sequences include sequences that are orthologs of the disclosed 
sequences. By "orthologs" is intended genes derived from a common ancestral gene 
and which are found in different species as a result of speciation. Genes found in 
1 0 different species are considered orthologs when their nucleotide sequences and/or 
their encoded polypeptide sequences share substantial identity as defined elsewhere 
herein. Functions of orthologs are often highly conserved among species. Thus, 
isolated sequences that encode an antimicrobial protein and which hybridize under 
stringent conditions to the nucleotide sequences disclosed herein, or to fragments 
15 thereof, are encompassed by the present invention. 

In a PCR approach, oligonucleotide primers can be designed for use in PCR 
reactions to amplify corresponding DNA sequences from cDNA or genomic DNA 
extracted from any insect of interest. Methods for designing PCR primers and PCR 
cloning are generally known in the art and are disclosed in Sambrook et al. (1989) 
20 Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory 
Press, Plainview, New York). See also Innis et al., eds. (1990) PCR Protocols: A 
Guide to Methods and Applications (Academic Press, New York); Innis and Gelfand, 
eds. (1995) PCR Strategies (Academic Press, New York); and Innis and Gelfand, eds. 
(1 999) PCR Methods Manual (Academic Press, New York). Known methods of PCR 
25 include, but are not limited to, methods using paired primers, nested primers, single 
specific primers, degenerate primers, gene-specific primers, vector-specific primers, 
partially-mismatched primers, and the like. 

In hybridization techniques, all or part of a known nucleotide sequence is used 
as a probe that selectively hybridizes to other corresponding nucleotide sequences 
30 present in a population of cloned genomic DNA fragments or cDNA fragments (/. e., 
genomic or cDNA libraries) from a chosen organism. The hybridization probes may 
be genomic DNA fragments, cDNA fragments, RNA fragments, or other 
oligonucleotides, and may be labeled with a detectable group such as 32 P, or any other 
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detectable marker. Thus, for example, probes for hybridization can be made by 
labeling synthetic oligonucleotides based on the disease resistant sequences of the 
invention. Methods for preparation of probes for hybridization and for construction of 
cDNA and genomic libraries are generally known in the art and are disclosed in 
5 Sambrook et ah (1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold 
Spring Harbor Laboratory Press, Plainview, New York). 

For example, an entire nucleotide sequence disclosed herein, or one or more 
portions thereof, may be used as a probe capable of specifically hybridizing to the 
corresponding nucleotide sequences and messenger RNA's. To achieve specific 

10 hybridization under a variety of conditions, such probes include sequences that are 
unique among the nucleotide sequences of the invention and are preferably at least 
about 10 nucleotides in length, and most preferably at least about 20 nucleotides in 
length. Such probes may be used to amplify corresponding sequences from a chosen 
organism by PCR. This technique may be used to isolate additional coding sequences 

1 5 from a desired organism or as a diagnostic assay to determine the presence of coding 
sequences in an organism. Hybridization techniques include hybridization screening 
of plated DNA libraries (either plaques or colonies; see, for example, Sambrook et ah 
(1989) Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor 
Laboratory Press, Plainview, New York). 

20 Hybridization of such sequences may be carried out under stringent 

conditions. By "stringent conditions" or "stringent hybridization conditions" is 
intended conditions under which a probe will hybridize to its target sequence to a 
detectably greater degree than to other sequences (e.g., at least 2-fold over 
background). Stringent conditions are sequence-dependent and will be different in 

25 different circumstances. By controlling the stringency of the hybridization and/or 

washing conditions, target sequences that are 100% complementary to the probe can 
be identified (homologous probing). Alternatively, stringency conditions can be 
adjusted to allow some mismatching in sequences so that lower degrees of similarity 
are detected (heterologous probing). Generally, a probe is less than about 1000 

30 nucleotides in length, preferably less than 500 nucleotides in length. 

Thus, isolated sequences that encode for an anti-microbial polypeptide and 
which hybridize under stringent conditions to a sequence disclosed herein, or to 
fragments thereof, are encompassed by the present invention.. 
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Typically, stringent conditions will be those in which the salt concentration is 
less than about 1 .5 M Na ion, typically about 0.01 to 1 .0 M Na ion concentration (or 
other salts) at pH 7.0 to 8.3 and the temperature is at least about 30"C for short probes 
(e.g., 10 to 50 nucleotides) and at least about 60'C for long probes (e.g., greater than 
5 50 nucleotides). Stringent conditions may also be achieved with the addition of 
destabilizing agents such as formamide. Exemplary low stringency conditions 
include hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCl, 1% 
SDS (sodium dodecyl sulphate) at 37° C, and a wash in IX to 2X SSC (20X SSC = 3.0 
M NaCl/0.3 M trisodium citrate) at 50 to 55°C. Exemplary moderate stringency 
1 0 conditions include hybridization in 40 to 45% formamide, 1 .0 M NaCl, 1 % SDS at 
37"C, and a wash in 0.5X to IX SSC at 55 to 60°C. Exemplary high stringency 
conditions include hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37°C, and 
a wash in 0.1 X SSC at 60 to 65'C. Duration of hybridization is generally less than 
about 24 hours, usually about 4 to about 12 hours. 
1 5 Specificity is typically the function of post-hybridization washes, the critical 

factors being the ionic strength and temperature of the final wash solution. For DNA- 
DNA hybrids, the T m can be approximated from the equation of Meinkoth and WahJ 
(1 984) Anal. Biochem. 1 38:267-284: T m = 8 1 .5°C + 1 6.6 (log M) + 0.41 (%GC) - 
0.61 (% form) - 500/L; where M is the molarity of monovalent cations, %GC is the 
20 percentage of guanosine and cytosine nucleotides in the DNA, % form is the 

percentage of formamide in the hybridization solution, and L is the length of the 
hybrid in base pairs. The T ra is the temperature (under defined ionic strength and pH) 
at which 50% of a complementary target sequence hybridizes to a perfectly matched 
probe. T m is reduced by about TC for each 1% of mismatching; thus, T m , 
25 hybridization, and/or wash conditions can be adjusted to hybridize to sequences of the 
desired identity. For example, if sequences with >90% identity are sought, the T m can 
be decreased 10°C. Generally, stringent conditions are selected to be about 5°C lower 
than the thermal melting point (T m ) for the specific sequence and its complement at a 
defined ionic strength and pH. However, severely stringent conditions can utilize a 
30 hybridization and/or wash at 1 , 2, 3 , or 4°C lower than the thermal melting point (T m ); 
moderately stringent conditions can utilize a hybridization and/or wash at 6, 1, 8, 9, or 
1 0°C lower than the thermal melting point (T m ); low stringency conditions can utilize 
a hybridization and/or wash at 1 1, 12, 13, 14, 15, or 20°C lower than the thermal 
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melting point (T m ). Using the equation, hybridization and wash compositions, and 
desired T m , those of ordinary skill will understand that variations in the stringency of 
hybridization and/or wash solutions are inherently described. If the desired degree of 
mismatching results in a T m of less than 45"C (aqueous solution) or 32°C (formamide 
5 solution), it is preferred to increase the SSC concentration so that a higher temperature 
can be used. An extensive guide to the hybridization of nucleic acids is found in 
Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology — 
Hybridization with Nucleic Acid Probes, Part I, Chapter 2 (Elsevier, New York); and 
Ausubel et al., eds. (1995) Current Protocols in Molecular Biology, Chapter 2 

10 (Greene Publishing and Wiley-Interscience, New York). See Sambrook et al. (1989) 
Molecular Cloning: A Laboratory Manual (2d ed., Cold Spring Harbor Laboratory 
Press, Plainview, New York). 

The following terms are used to describe the sequence relationships between 
two or more nucleic acids or polynucleotides: (a) "reference sequence," (b) 

15 "comparison window," (c) "sequence identity," (d) "percentage of sequence identity," 
and (e) "substantial identity." 

(a) As used herein, "reference sequence" is a defined sequence used as a 
basis for sequence comparison. A reference sequence may be a subset or the entirety 
of a specified sequence; for example, a segment of a full-length cDNA or gene 

20 sequence, or the complete cDNA or gene sequence. 

(b) As used herein, "comparison window" makes reference to a contiguous 
and specified segment of a polynucleotide sequence, wherein the polynucleotide 
sequence in the comparison window may comprise additions or deletions (i.e., gaps) 
compared to the reference sequence (which does not comprise additions or deletions) 

25 for optimal alignment of the two sequences. Generally, the comparison window is at 
least 20 contiguous nucleotides in length, and optionally can be 30, 40, 50, 100, or 
longer. Those of skill in the art understand that to avoid a high similarity to a 
reference sequence due to inclusion of gaps in the polynucleotide sequence a gap 
penalty is typically introduced and is subtracted from the number of matches. 

30 Methods of alignment of sequences for comparison are well known in the art. 

Thus, the determination of percent identity between any two sequences can be 
accomplished using a mathematical algorithm. Non-limiting examples of such 
mathematical algorithms are the algorithm of Myers and Miller (1988) CABIOS 4:1 1- 
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17; the local homology algorithm of Smith et al. (1981) Adv. Appl. Math. 2:482; the 
homology alignment algorithm of Needlcman and Wunsch (1970) J. Mol. Biol. 
48:443-453; the search-for-similarity-method of Pearson and Lipman (1988) Proc. 
Natl. Acad. Sci. 85:2444-2448; the algorithm of Karlin and Altschul (1 990) Proc. 
5 Natl. Acad. Sci. USA 87:2264, modified as in Karlin and Altschul (1993) Proc. Natl. 
Acad. Sci. USA 90:5873-5877. 

Computer implementations of these mathematical algorithms can be utilized 
for comparison of sequences to determine sequence identity. Such implementations 
include, but are not limited to: CLUSTAL in the PC/Gene program (available from 
1 0 Intelligenetics, Mountain View, California); the ALIGN program (Version 2.0); the 
ALIGN PLUS program (version 3.0, copyright 1997); and GAP, BESTFIT, BLAST, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Version 8 
(available from Genetics Computer Group (GCG), 575 Science Drive, Madison, 
Wisconsin, USA). Alignments using these programs can be performed using the 
1 5 default parameters. The CLUSTAL program is well described by Higgins et al. 

(1988) Gene 73:237-244 (1988); Higgins et al. (1989) 045/05-5:151-153; Corpet et 
al. (1988) Nucleic Acids Res. 16:10881-90; Huang et al. (1992) CABIOS 8:155-65; 
and Pearson et al. (1994) Meth. Mol. Biol. 24:307-331. The ALIGN and the ALIGN 
PLUS programs are based on the algorithm of Myers and Miller (1988) supra. A 
20 PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can 
be used with the ALIGN program when comparing amino acid sequences. 

The BLAST programs of Altschul et al (1990) J. Mol. Biol. 215:403 are based 
on the algorithm of Karlin and Altschul (1990) supra. The BLAST family of 
programs that can be used for database similarity searches includes: BLASTN for 
25 nucleotide query sequences against nucleotide database sequences; BLASTP for 

peptide query sequences against a peptide database; BLASTX for nucleotide query 
sequences against protein database sequences; TBLASTN for protein query sequences 
against nucleotide database sequences; and TBLASTX for nucleotide query 
sequences against nucleotide databases with the translation of all nucleotide 
30 sequences to protein. BLAST nucleotide searches can be performed with the 

BLASTN program, score = 100, wordlength = 12, to obtain nucleotide sequences 
homologous to a nucleotide sequence encoding a polypeptide of the invention. 
BLAST protein searches can be performed with the BLASTX program, score = 50, 
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wordlength = 3, to obtain amino acid sequences homologous to a protein or 
polypeptide of the invention. To obtain gapped alignments for comparison purposes, 
Gapped BLAST (in BLAST 2.0) can be utilized as described in Altschul et al. (1997) . 
Nucleic Acids Res. 25:3389. Alternatively, PSI-BLAST (in BLAST 2.0) can be used 
5 to perform an iterated search that detects distant relationships between molecules. See 
Altschul et al. (1997) supra. When utilizing BLAST, Gapped BLAST, PSI-BLAST, 
the default parameters of the respective programs (e.g., BLASTN for nucleotide 
sequences, BLASTX for proteins) can be used. See www.ncbi.hlm.nih.gov. 
Alignment may also be performed manually by inspection. 

10 Unless otherwise stated, sequence identity/similarity values provided herein 

refer to the value obtained using GAP Version 10 using the following parameters: % 
identity using GAP Weight of 50 and Length Weight of 3; % similarity using Gap 
Weight of 12 and Length Weight of 4, or any equivalent program. By "equivalent 
program" is intended any sequence comparison program that, for any two sequences 

1 5 in question, generates an alignment having identical nucleotide or amino acid residue 
matches and an identical percent sequence identity when compared to the 
corresponding alignment generated by the preferred program, 

GAP uses the algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 
48:443-453, to find the alignment of two complete sequences that maximizes the 

20 number of matches and minimizes the number of gaps. GAP considers all possible 
alignments and gap positions and creates the alignment with the largest number of 
matched bases and the fewest gaps. It allows for the provision of a gap creation 
penalty and a gap extension penalty in units of matched bases. GAP must make a 
profit of gap creation penalty number of matches for each gap it inserts. If a gap 

25 extension penalty greater than zero is chosen, GAP must, in addition, make a profit 

for each gap inserted of the length of the gap times the gap extension penalty. Default 
gap creation penalty values and gap extension penalty values in Version 10 of the 
Wisconsin Genetics Software Package for protein sequences are 8 and 2, respectively. 
For nucleotide sequences the default gap creation penalty is 50 while the default gap 

30 extension penalty is 3. The gap creation and gap extension penalties can be expressed 
as an integer selected from the group of integers consisting of from 0 to 200. Thus, 
for example, the gap creation and gap extension penalties can be 0, 1 , 2, 3, 4, 5, 6, 7, 
8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65 or greater. 
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GAP presents one member of the family of best alignments. There may be 
many members of this family, but no other member has a better quality. GAP 
displays four figures of merit for alignments: Quality, Ratio, Identity, and Similarity. 
The Quality is the metric maximized in order to align the sequences. Ratio is the 
5 quality divided by the number of bases in the shorter segment. Percent Identity is the 
percent of the symbols that actually match. Percent Similarity is the percent of the 
symbols that are similar. Symbols that are across from gaps are ignored. A similarity 
is scored when the scoring matrix value for a pair of symbols is greater than or equal 
to 0.50, the similarity threshold. The scoring matrix used in Version 1 0 of the 
1 0 Wisconsin Genetics Software Package is BLOSUM62 (see Henikoff and Henikoff 
(1989) /Voc. Natl. Acad. Sci. USA 89:10915). 

For purposes of the present invention, comparison of nucleotide or 
polypeptide sequences for determination of percent sequence identity to the 
nucleotide or polypeptide sequences disclosed herein is preferably made using the 
1 5 Clustol W program (Version 1 .7 or later) with its default parameters or any equivalent 
program. By "equivalent program" is intended any sequence comparison program 
that, for any two sequences in question, generates an alignment having identical 
nucleotide or amino acid residue matches and an identical percent sequence identity 
when compared to the corresponding alignment generated by the preferred program. 
20 (c) As used herein > "sequence identity" or "identity" in the context of two 

nucleic acid or polypeptide sequences makes reference to the residues in the two 
sequences that are the same when aligned for maximum correspondence over a 
specified comparison window. When percentage of sequence identity is used in 
reference to proteins it is recognized that residue positions which are not identical 
25 often differ by conservative amino acid substitutions, where amino acid residues are 
substituted for other amino acid residues with similar chemical properties (e.g., charge 
or hydrophobicity) and therefore do not change the functional properties of the 
molecule. When sequences differ in conservative substitutions, the percent sequence 
identity may be adjusted upwards to correct for the conservative nature of the 
30 substitution. Sequences that differ by such conservative substitutions are said to have 
"sequence similarity" or "similarity". Means for making this adjustment are well 
known to those of skill in the art. Typically this involves scoring a conservative 
substitution as a partial rather than a full mismatch, thereby increasing the percentage 
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sequence identity. Thus, for example, where an identical amino acid is given a score 
of 1 and a non-conservative substitution is given a score of zero, a conservative 
substitution is given a score between zero and 1 . The scoring of conservative 
substitutions is calculated, e.g., as implemented in the program PC/GENE 
5 (Intelligenetics, Mountain View, California). 

(d) As used herein, "percentage of sequence identity" means the value 
determined by comparing two optimally aligned sequences over a comparison 
window, wherein the portion of the polynucleotide sequence in the comparison 
window may comprise additions or deletions (i.e., gaps) as compared to the reference 

1 0 sequence (which does not comprise additions or deletions) for optimal alignment of 
the two sequences. The percentage is calculated by determining the number of 
positions at which the identical nucleic acid base or amino acid residue occurs in both 
sequences to yield the number of matched positions, dividing the number of matched 
positions by the total number of positions in the window of comparison, and 

1 5 multiplying the result by 1 00 to yield the percentage of sequence identity. 

(e) (i) The term "substantial identity" of polynucleotide sequences means that 
a polynucleotide comprises a sequence that has at least 70%, 80%, 90%, 95%, or 
more sequence identity compared to a reference sequence using one of the alignment 
programs described using standard parameters. One of skill in the art will recognize 

20 that these values can be appropriately adjusted to determine corresponding identity of 
polypeptides encoded by two nucleotide sequences by taking into account codon 
degeneracy, amino acid similarity, reading frame positioning, and the like. 
Substantial identity of amino acid sequences for these purposes normally means 
sequence identity of at least 60%, more preferably at least 70%, 80%, 90%, or 95%. 

25 Another indication that nucleotide sequences are substantially identical is if 

two molecules hybridize to each other under stringent conditions. Generally, 
stringent conditions are selected to be about 5°C lower than the thermal melting point 
(T m ) for the specific sequence at a defined ionic strength and pH. However, stringent 
conditions encompass temperatures in the range of about 1°C to about 20°C, 

30 depending upon the desired degree of stringency as otherwise qualified herein. 

Nucleic acids that do not hybridize to each other under stringent conditions are still 
substantially identical if the polypeptides they encode are substantially identical. This 
may occur, e.g., when a copy of a nucleic acid is created using the maximum codon 
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degeneracy permitted by the genetic code. One indication that two nucleic acid 
sequences are substantially identical is when the polypeptide encoded by the first 
nucleic acid is immunologically cross reactive with the polypeptide encoded by the 
second nucleic acid. 

5 (e)(ii) The term "substantial identity" in the context of a peptide indicates that 

a peptide comprises a sequence with at least 70%, 80%, 85%, 90%, or 95% sequence 
identity to the reference sequence over a specified comparison window. Preferably, 
optimal alignment is conducted using the homology alignment algorithm of 
Needleman and Wunsch (1970)7. Mol. Biol. 48:443-453. An indication that two 
1 0 peptide sequences are substantially identical is that one peptide is immunologically 
reactive with antibodies raised against the second peptide. Thus, a peptide is 
substantially identical to a second peptide, for example, where the two peptides differ 
only by a conservative substitution. Peptides that are "substantially similar" share 
sequences as noted above except that residue positions that are not identical may 
15 differ by conservative amino acid changes. 

The nucleic acid sequences of the present invention can be expressed in a host 
cell such as bacteria, fungi, yeast, insect, mammalian, or plant cells. It is expected 
that those of skill in the art are knowledgeable in the numerous expression systems 
available for expression of a nucleic acid encoding a polypeptide of the present 
20 invention. No attempt to describe in detail the various methods known for the 
expression of polypeptides in prokaryotes or eukaryotes will be made. 

As used herein, "heterologous" in reference to a nucleic acid is a nucleic acid 
that originates from a foreign species, or, if from the same species, is substantially 
modified from its native form in composition and/or genomic locus by deliberate 
25 human intervention. For example, a promoter operably linked to a heterologous 
structural gene is from a species different from that from which the structural gene 
was derived, or, if from the same species, one or both are substantially modified from 
their original form. A heterologous protein may originate from a foreign species, or, 
if from the same species, is substantially modified from its original form by deliberate 
30 human intervention. 

By "host cell" is meant a cell, which comprises a heterologous nucleic acid 
sequence of the invention. Host cells may be prokaryotic cells such as R coli, or 
eukaryotic cells such as yeast, insect, amphibian, or mammalian cells. Preferably, 
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host cells are monocotyledonous or dicotyledonous plant cells, particularly rice and 
maize plant cells. 

The disease resistance-conferring sequences of the invention are provided in 
expression cassettes or DNA constructs for expression in the plant of interest. The 
5 cassette will include 5 f and 3* regulatory sequences operably linked to a nucleotide 
sequence of the invention. By "operably linked" is intended a functional linkage 
between a promoter and a second sequence, wherein the promoter sequence initiates 
and mediates transcription of the nucleotide sequence corresponding to the second 
sequence. Generally, operably linked means that the nucleic acid sequences being 

1 0 linked are contiguous and, where necessary to join two protein coding regions, 

contiguous and in the same reading frame. The cassette may additionally contain at 
least one additional gene to be cotransformed into the organism. Alternatively, the 
additional gene(s) can be provided on multiple expression cassettes. 

Such an expression cassette is provided with a plurality of restriction sites for 

1 5 insertion of the disease resistant sequence to be under the transcriptional regulation of 
the regulatory regions. The expression cassette may additionally contain selectable 
marker genes. 

The expression cassette will include in the 5-3 ' direction of transcription, a 
transcriptional and translational initiation region, a signal peptide sequence, a disease 
20 resistant DNA sequence of the invention, and a transcriptional and translational 
termination region functional in plants. The transcriptional initiation region, the 
promoter, may be native or analogous or foreign or heterologous to the plant host. 
Additionally, the promoter may be the natural sequence or alternatively a synthetic 
sequence. By "foreign" is intended that the transcriptional initiation region is not 
25 found in the native plant into which the transcriptional initiation region is introduced. 
As used herein, a chimeric gene comprises a coding sequence operably linked to a 
transcription initiation region that is heterologous to the coding sequence. 

While it may be preferable to express the sequences using heterologous 
promoters, the native promoter sequences may be used. Such constructs would vary 
30 expression levels of the disease resistant RN A/protein in the plant or plant cell. Thus, 
the phenotype of the plant or plant cell is altered. 

The termination region may be native with the transcriptional initiation region, 
may be native with the operably linked DNA sequence of interest, or may be derived 
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from another source. Convenient termination regions are available from the Ti- 
plasmid of A. tumefaciens, such as the octopine synthase and nopaline synthase 
termination regions. See also Guerineau et al. (1991)M>/. Gen. Genet. 262:141-144; 
Proudfoot (1991) Cell 64:671-674; Sanfacon et al. (1991) Genes Dev. 5:141-149; 
5 Mogen et al. (1990) Plant Cell 2:1261-1272; Munroe et al. (1990) Gene 91 :151-158; 
Ballas etal. (1989) Nucleic Acids Res. 17:7891-7903; and Joshi et al. (1987) Nucleic 
Acid Res. 15:9627-9639. 

Where appropriate, the nucleotide sequences may be optimized for increased 
expression in the transformed host. That is, the nucleotide sequences can be 
1 0 synthesized using plant-preferred codons for improved expression in plants. Methods 
are available in the art for synthesizing plant-preferred nucleotide sequences or genes. 
See, for example, U.S. Patent Nos. 5,380,831, and 5,436,391, and Murray et al. 
(1989) Nucleic Acids Res. 17:477-498, herein incorporated by reference. Nucleotide 
sequences have been created that encode Fusl and Fus2 operably linked to BAA and 
1 5 codon biased for expression in host cells. The BAA-Fusl nucleotide sequence was 
codon-biased according to M. sexta codon usage. The BAA-Fus2 nucleotide 
sequence was codon-biased according to Streptomyces coelicolor codon usage. S. 
coelicolor codon usage patterns resemble the codon usage patterns of many plants. 
The development of the codon-biased sequences is described elsewhere herein. 
20 Additional sequence modifications are known to enhance gene expression in a 

cellular host. These include elimination of sequences encoding spurious 
polyadenylation signals, exon-intron splice site signals, transposon-like repeats, and 
other such well-characterized sequences that may be deleterious to gene expression. 
The G-C content of the sequence may be adjusted to levels average for a given 
25 cellular host, as calculated by reference to known genes expressed in the host cell. 

When possible, the sequence is modified to avoid predicted hairpin secondary mRNA 
structures. 

In certain embodiments of the invention, it is desirable to utilize the mature 
peptide or the nucleotide sequence encoding the mature peptide. Within the cell, 
30 proteolytic modifications of amino acid sequences occur frequently. The proteolytic 
event removes amino acids from the precursor polypeptide to yield a mature peptide. 
The proteolytic processing can be highly sequence specific. Often the precursor 
peptides are inactive while the mature peptides possess the desired activity. Thus, 
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isolation of a peptide based on its activity results in isolation of the active, mature 
peptide. Discovery of the existence of pre-sequences occurs when the nucleotide 
sequence encoding the mature peptide is identified. The open reading frame that 
encodes the mature peptide also encodes the presequences that were removed by the 
5 cell. Proteolytic maturation of amino acid sequences occurs in multiple cellular 

locations including, but not limited to, the endoplasmic reticulum, the cytoplasm, the 
mitochondria, the chloroplasts, the nucleus, the Golgi Apparatus, and the extracellular 
matrix. Proteolytic processing of peptides is discussed in Creighton, T. E. (1993) 
Proteins: Structures & Molecular Properties. W. H. Freeman & Co., U.S.A and 

1 0 Alberts et al eds. ( 1 994) Molecular Biology of the Cell. Garland Publishing, Inc., New 
York, herein incorporated by reference. Rather than rely on a host cell to properly 
process the polypeptide of the invention, employment of a nucleotide sequence 
encoding the mature peptide may be desirable. 

The expression cassettes may additionally contain 5 1 leader sequences in the 

15 expression cassette construct. Such leader sequences can act to enhance translation. 
Translation leaders are known in the art and include: picornavirus leaders, for 
example, EMCV leader (Encephalomyocarditis 5' noncoding region) (Elroy-Stein et 
al. (1989) PNAS USA 86:6126-6130); potyvirus leaders, for example, TEV leader 
(Tobacco Etch Virus) (Allison et al (1986); MDMV leader (Maize Dwarf Mosaic 

20 Virus); Virology 154:9-20), and human immunoglobulin heavy-chain binding protein 
(BiP), (Macejak et al (1991) Nature 353:90-94); untranslated leader from the coat 
protein mRNA of alfalfa mosaic virus (AMV RNA 4) (Jobling et al. (1987) Nature 
325:622-625); tobacco mosaic virus leader (TMV) (Gallie et al (1989) in Molecular 
Biology of RNA, ed. Cech (Liss, New York), pp. 237-256); and maize chlorotic mottle 

25 virus leader (MCMV) (Lommel et al. (1991) Virology 81 :382-385). See also, 
Della-Cioppa et al (1 987) Plant Physiol 84:965-968. Other methods known to 
enhance translation can also be utilized, for example, introns, and the like. 

Signal peptides may be fused to the disease resistant nucleotide sequence of 
the invention to direct transport of the expressed gene product out of the cell to the 

30 desired site of action in the intercellular space. Examples of signal peptides include 
those natively linked to the Barley alpha amylase protein (BAA), sporamin, 
oryzacystatin-I, and those from the plant pathogenesis-related proteins, e.g. PR-1, PR- 
2 etc. 
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In preparing the expression cassette, the various DNA fragments may be 
manipulated, so as to provide for the DNA sequences in the proper orientation and, as 
appropriate, in the proper reading frame. Toward this end, adapters or linkers may be 
employed to join the DNA fragments or other manipulations may be involved to 
5 provide for convenient restriction sites, removal of superfluous DNA, removal of 
restriction sites, or the like. For this purpose, in vitro mutagenesis, primer repair, 
restriction, annealing, resubstitutions, e.g., transitions and transversions, may be 
involved. 

Generally, the expression cassette will comprise a selectable marker gene for the 
1 0 selection of transformed cells. Selectable marker genes are utilized for the selection of 
transformed cells or tissues. Marker genes include genes encoding antibiotic resistance, 
such as those encoding neomycin phosphotransferase II (NEO) and hygromycin 
phosphotransferase (HPT), as well as genes conferring resistance to herbicidal 
compounds, such as glufosinate ammonium, bromoxynil, imidazolinones, and 2,4- 
1 5 dichlorophenoxyacetate (2,4-D), and sulfonylureas (SUs). See generally, Yarranton 

(1992) Curr. Opin. Biotech. 3:506-511; Christopherson et al. (1992) Proc. Natl. Acad. 
Sci. USA 89:6314-6318; Yao et al. (1992) Cell 71:63-72; Reznikoff (1992) M>/. 
Microbiol. 6:2419-2422; Barkley et al. (1980) in The Operon, pp. 177-220; Hu et al. 
(1987) Cell 48:555-566; Brown et al. (1987) Cell 49:603-612; Figge et al. (1988) Cell 

20 52:713-722; Deuschle et al. (1989) Proc. Natl. Acad. Sci. USA 86:5400-5404; Fuerst et 
al. (1989)Proc. Natl. Acad Sci. USA 86:2549-2553; Deuschle et al. (1990) Science 
248:480-483; Gossen (1993) Ph.D. Thesis, University of Heidelberg; Reines et al. 

(1993) Proc. Natl Acad Sci. USA 90:1917-1921; Labow et al. (1990) Mol. Cell. Biol. 
10:3343-3356; Zambretti et al. (1992) Proc. Natl. Acad Sci. USA 89:3952-3956; Bairn 

25 et al. (1991) Proc. Natl Acad Sci. USA 88:5072-5076; Wyborski et al. (1991) Nucleic 
Acids Res. 19:4647-4653; Hillenand-Wissman (1989) Topics Mol Struc. Biol 10:143- 
162; Degenkolbe/a/. (1991) Antimicrob. Agents Chemother. 35:1591-1595; 
Kleinschnidt et al. (1988) Biochemistry 27:1094-1 104; Bonin (1993) Ph.D. Thesis, 
University of Heidelberg; Gossen et al (1992) Proc. Natl. Acad Sci. USA 89:5547- 

30 5551;Ohvae/a/. (1 992) Antimicrob. Agents Chemother. 36:913-919; Hlavkae/a/. 
(1985) Handbook of Experimental Pharmacology, Vol. 78 ( Springer- Verlag, Berlin); 
Gill etal. (\9%S)Nature 334:721-724. Such disclosures are herein incorporated by ' 
reference. 
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The above list of selectable marker genes is not meant to be limiting. Any 
selectable marker gene can be used in the present invention. 

A number of promoters can be used in the practice of the invention. The 
promoters can be selected based on the desired outcome. That is, the nucleic acids 
5 can be combined with constitutive, tissue-preferred, inducible or other promoters for 
expression in plants. Such constitutive promoters include, for example, the core 
promoter of the Rsyn7 promoter and other constitutive promoters disclosed in WO 
99/43838 and U.S. Patent No. 6,072,050; Scpl promoter (U.S. Patent 6,072,050), rice 
actin (McElroy et al. (1990) Plant Cell 2:163-171); ubiquitin (Christensen et at. 
10 (1989) Plant Mot. Biol. 12:619-632 and Christensen et at. (1992) Plant Mot. Biol. 
18:675-689); pEMU (Last et al (1991) Theor. Appl. Genet. 81:581-588); MAS 
(Velten et al. (1984) EMBOJ. 3:2723-2730); ALS promoter (U.S. Patent No. 
5,659,026), Maize h2B ( PCT Application Serial NO. WO 99/43797) and the like. 
Other constitutive promoters include, for example, U.S. Patent Nos. 5,608,149; 
15 5,608,144; 5,604,121; 5,569,597; 5,466,785; 5,399,680; 5,268,463; 5,608,142; and 
6,177,611. 

Generally, it will be beneficial to express the gene from an inducible promoter, 
particularly from a pathogen-inducible promoter. Such promoters include those from 
pathogenesis-related proteins (PR proteins), which are induced following infection by 

20 a path6gen; e.g., PR proteins, SAR proteins, beta- 1 ,3-glucanase, chitinase, etc. See, 
for example, Redolfi et al. (1983) Neth. J. Plant Pathol. 89:245-254; Uknes et al. 
(1992) Plant Cell 4:645-656; and Van Loon (1985) Plant Mol. Virol. 4:1 1 1-116. See 
also WO 99/43819, herein incorporated by reference. 

Of interest are promoters that are expressed locally at or near the site of 

25 pathogen infection. See, for example, Marineau et al. (1987) Plant Mol. Biol. 9:335- 
342; Matton et al. (1989) Molecular Plant-Microbe Interactions 2:325-331; Somsisch 
etal. (1986) Proc. Natl. Acad. Sci. USA 83:2427-2430; Somsisch et al. (1988) Mol. 
Gen. Genet. 2:93-98; and Yang (1996) Proc. Natl. Acad. Sci. USA 93:14972-14977. 
See also, Chen et al. (1 996) Plant J. 10:955-966; Zhang et al. (1994) Proc. Natl. 

30 Acad. Sci. USA 91:2507-2511; Warner etal. (1993) Plant J. 3:191-201; Siebertz etal. 
(1989) Plant Cell 1 :961-968; U.S. Patent No. 5,750,386 (nematode-inducible); and 
the references cited therein. Of particular interest is the inducible promoter for the 
maize PRms gene, whose expression is induced by the pathogen Fusarium 
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moniliforme (see, for example, Cordero et al. (1992) Physiol. Mol. Plant Path. 
41:189-200). 

Additionally, as pathogens find entry into plants through wounds or insect 
damage, a wound-inducible promoter may be used in the constructions of the 
5 invention. Such wound-inducible promoters include potato proteinase inhibitor (pin 
0) gene (Ryan (1 990) Ann. Rev. Phytopath. 28:425-449; Duanetal. (1996) Nature 
Biotechnology 14:494-498); wunl and wun2, US Patent No. 5,428,148; winl and 
win2 (Stanford et al. (1 989) Mol. Gen. Genet. 215:200-208); systemin (McGurl et al. 
(1992) Science 225:1570-1573); WIP1 (Rohmeier et al. (1993) Plant Mol. Biol. 
10 22:783-792; Eckelkamp et al. (1993) FEBS Letters 323:73-76); MPI gene (Corderok 
et al. (1994) Plant J. 6(2): 141 -150); and the like, herein incorporated by reference. 

Chemical-regulated promoters can be used to modulate the expression of a 
gene in a plant through the application of an exogenous chemical regulator. 
Depending upon the objective, the promoter may be a chemical-inducible promoter, 
1 5 where application of the chemical induces gene expression, or a chemical-repressible 
promoter, where application of the chemical represses gene expression. Chemical- 
inducible promoters are known in the art and include, but are not limited to, the maize 
In2-2 promoter, which is activated by benzenesulfonamide herbicide safeners, the 
maize GST promoter, which is activated by hydrophobic electrophilic compounds that 
20 are used as pre-emergent herbicides, and the tobacco PR- la promoter, which is 

activated by salicylic acid. Other chemical-regulated promoters of interest include 
steroid-responsive promoters (see, for example, the glucocorticoid-inducible promoter 
in Schena et al. (1991) Proc. Natl. Acad. Sci. USA 88:10421-10425 and McNeills et 
al. (1998) Plant J. 14(2): 247-2 57) and tetracycline-inducible and tetracycline- 
25 repressible promoters (see, for example, Gatz et al. (1991) Mol. Gen. Genet. 227:229- 
237, and U.S. Patent Nos. 5,814,618 and 5,789,156), herein incorporated by 
reference. 

Tissue-preferred promoters can be utilized to target enhanced antimicrobial 
polypeptide expression within a particular plant tissue. See, for example, Yamamoto 
30 et al. (1997) Plant J. 12(2):255-265; Kawamata et al. (1997) Plant Cell Physiol. 

38(7):792-803; Hansen et al. (1997) Mol. Gen Genet. 254(3):337-343; Russell et al. 
(1997) Transgenic Res. 6(2):157-168; Rinehart et al. (1996) Plant Physiol. 
1 12(3):1331-1341; Van Camp et al. (1996) Plant Physiol. 1 12(2):525-535; 
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Canevascini et al. (1996) Plant Physiol. 1 12(2):5 13-524; Yamamoto et al. (1994) 
Plant Cell Physiol. 35(5):773-778; Lam (1994) Results Probl. Cell Differ. 20:181- 
1 96; Orozco et al. (1993) Plant Mol Biol. 23(6): 1 129-1 138; Matsuoka et al. (1993) 
Proc. Natl. Acad. Sci. USA 90(20):9586-9590; and Guevara-Garcia et al. (1993) Plant 
5 J. 4(3):495-505. Such promoters can be modified, if necessary, for weak expression. 
The method of transformation/transfection is not critical to the instant 
invention; various methods of transformation or transfection are currently available. 
Thus, any method, which provides for effective transformation/transfection may be 
employed. Transformation protocols as well as protocols for introducing nucleotide 
1 0 sequences into plants may vary depending on the type of plant or plant cell, i.e., 
monocot or dicot, targeted for transformation. Suitable methods of introducing 
nucleotide sequences into plant cells and subsequent insertion into the plant genome 
include microinjection (Crossway et al. (1986) Biotechniques 4:320-334), 
electroporation (Riggs et al. (1986) Proc. Natl. Acad. Sci. USA 83:5602-5606, 
1 5 Agrobacterium-mediated transformation (Townsend et al., U.S. Patent No. 5,563,055; 
Zhao et al., U.S. Patent N6. 5,981,840), direct gene transfer (Paszkowski et al, (1984) 
EMBOJ. 3:2717-2722), and ballistic particle acceleration (see, for example, Sanford 
et al., U.S. Patent No. 4,945,050; Tomes et al., U.S. Patent No. 5,879,918; Tomes et 
al., U.S. Patent No. 5,886,244; Bidney et al., U.S. Patent No. 5,932,782; Tomes et al. 
20 ( 1 995) "Direct DNA Transfer into Intact Plant Cells via Microproj ectile 

Bombardment," in Plant Cell, Tissue, and Organ Culture: Fundamental Methods, ed. 
Gamborg and Phillips (Springer- Verlag, Berlin); McCabe et al. (1988) Biotechnology 
6:923-926); and Lecl transformation (WO 00/28058), Also see Weissinger et al. 
(\9%S)Ann. Rev. Genet. 22:421-477; Sanford et al. (1987) Particulate Science and 
25 T schnology 5:27-37 (onion); Christou et al. (1 988) Plant Physiol. 87:671-674 

(soybean); McCabe et al. (1988) Bio/Technology 6:923-926 (soybean); Finer and 
McMullen (1991)/« Vitro Cell Dev. Biol. 27P.175-182 (soybean); Singh et al. (1998) 
Theor. Appl Genet. 96:319-324 (soybean); Datta et al. (1990) Biotechnology 
8:736-740 (rice); Klein et al. (1988) Proc. Natl. Acad. Sci. USA 85:4305-4309 
30 (maize); Klein et al. (1988) Biotechnology 6:559-563 (maize); Tomes, U.S. Patent 

No. 5,240,855; Buising et ah, U.S. Patent Nos. 5,322,783 and 5,324,646; Tomes et al. 
(1995) "Direct DNA Transfer into Intact Plant Cells via Microprojectile 
Bombardment," in Plant Cell, Tissue, and Organ Culture: Fundamental Methods, ed. 
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Gamborg (Springer- Verlag, Berlin) (maize); Klein et al. (1988) Plant Physiol. 
91:440-444 (maize); Fromm et al. (1990) Biotechnology 8:833-839 (maize); 
I looykaas-Van Slogteren et al. (1 984) Nature (London) 3 1 1 :763-764; Bowen et al, 
U.S. Patent No. 5,736,369 (cereals); Bytebier et al. (1987) Proc. Natl. Acad. Sci. USA 
5 84:5345-5349 (Liliaceae); De Wet et al. (1985) in The Experimental Manipulation of 
Chide Tissues, ed. Chapman et al. (Longman, New York), pp. 197-209 (pollen); 
Kaeppler et al. (1990) Plant Cell Reports 9:415-418 and Kaeppler et al. (1992) Theor. 
Appl. Genet. 84:560-566 (whisker-mediated transformation); D'Halluin et al. (1992) 
Plant Cell 4:1495-1505 (electroporation); Li et al. (1993) Plant Cell Reports 12:250- 
10 255 and Christou and Ford (1995) Annals of Botany 75:407-413 (rice); Osjoda et al. 
(1996) Nature Biotechnology 14:745-750 (maize via Agrobacterium tumefaciens); all 
of which are herein incorporated by reference. 

The cells that have been transformed may be grown into plants in accordance 
with conventional ways. See, for example, McCormick et al. (1986) Plant Cell 
1 5 Reports 5:81 -84. These plants may then be grown, and either pollinated with the 
same transformed strain or different strains, and the resulting hybrid having 
constitutive expression of the desired phenotypic characteristic identified. Two or 
more generations may be grown to ensure that constitutive expression of the desired 
phenotypic characteristic is stably maintained and inherited and then seeds harvested 
to ensure constitutive expression of the desired phenotypic characteristic has been 
achieved. 

The present invention may be used for transformation of any plant species, 
including, but not limited to, monocots and dicots. Examples of plants of interest 
include, but are not limited to, rice (Oryza sativa), com (Zea mays), Brassica sp. (e.g., 
B. napus, B. rapa, B.junced), particularly those Brassica species useful as sources of 
seed oil, alfalfa (Medicago sativa), rye (Secale cereale), sorghum (Sorghum bicolor, 
Sorghum vulgare), millet (e.g., pearl millet (Pennisetum glaucum), proso millet 
(Panicum miliaceum), foxtail millet (Setaria italica), finger millet (Eleusine 
coracana)), sunflower (Helianthus annuus), safflower (Carthamus tinctorius), wheat 
(Triticum aestivum), soybean (Glycine max), tobacco (Nicotiana tabacum), potato 
(Solamim tuberosum), peanuts (Arachis Irypogaea), cotton (Gossypium barbadense, 
Gossypium hirsutum), sweet potato (Ipomoea batatus), cassava (Manihot esculenta), 
coffee (Coffea spp.), coconut (Cocos nucifera), pineapple (Ananas comosus), citrus 
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trees (Cifrus spp.), cocoa (Theobroma cacao), tea {Camellia sinensis), banana (Musa 
spp.), avocado (Persea americana), fig (Ficus casica), guava (Psictium guajava), 
mango (Mangifera indica), olive (Olea europaea), papaya (Carica papaya), cashew 
(Anacardiitm occidentale), macadamia (Macadamia integrifolia), almond (Primus 
5 amygdalus), sugar beets (Beta vulgaris), sugarcane (Saccharum spp.), oats, barley, 
vegetables, ornamentals, and conifers. 

Vegetables include tomatoes (Lycopersicon escidentum), lettuce (e.g., Lactnca 
sativa), green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis), peas 
(Lathyrus spp.), and members of the genus Cucamis such as cucumber (C sativus), 

1 0 cantaloupe (C. cantalupensis) y and muskmelon (C. melo). Ornamentals include azalea 
(Rhododendron spp.), hydrangea (Macrophylla hydrangea), hibiscus (Hibiscus 
rosasanensis), roses (Rosa spp.), tulips (Tulipa spp.), daffodils (Narcissus spp.), petunias 
(Petunia hybrida), carnation (Diantlnis caryophyllus), poinsettia (Euphorbia 
pulcherrima), and chrysanthemum. Conifers that may be employed in practicing the 

1 5 present invention include, for example, pines such as loblolly pine (Pinus taeda), slash 
pine (Pinus elliotii), ponderosa pine (Pinus ponderosa), lodgepole pine (Pinus contorta), 
and Monterey pine (Pinus radiata); Douglas-fir (Pseudotsuga menziesii); Western 
hemlock (Tsuga canadensis), Sitka spruce (Picea glauca); redwood (Sequoia 
sempervirens); true firs such as silver fir (Abies amabilis) and balsam fir (Abies 

20 balsamea); and cedars such as Western red cedar (Thuja plicata) and Alaska 

yellow-cedar (Chamaecyparis nootkatensis). Preferably, plants of the present invention 
are crop plants (for example, rice, com, alfalfa, sunflower, Brassica, soybean, cotton, 
safflower, peanut, sorghum, wheat, millet, tobacco, etc.). 

Prokaryotic cells may be used as hosts for expression. Prokaryotes most 

25 frequently are represented by various strains of E. colt; however, other microbial 

strains may also be used. Commonly used prokaryotic control sequences which are 
defined herein to include promoters for transcription initiation, optionally with an 
operator, along with ribosome binding sequences, include such commonly used 
promoters as the beta lactamase (penicillinase) and lactose (lac) promoter systems 

30 (Chang et ah (1 977) Nature 1 98: 1 056), the tryptophan (tip) promoter system 
(Goeddel et al. (1980) Nucleic Acids Res, 8:4057) and the lambda derived P L 
promoter and N-gene ribosome binding site (Shimatake et al. (1981) Nature 292:128). 
The inclusion of selection markers in DNA vectors transfected in E colt is also 
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useful. Examples of such markers include genes specifying resistance to ampicillin, 
tetracycline, or chloramphenicol. 

The vector is selected to allow introduction into the appropriate host cell. 
Bacterial vectors are typically of plasmid or phage origin. Appropriate bacterial cells 
5 are infected with phage vector particles or transfected with naked phage vector DNA. 
If a plasmid vector is used, the bacterial cells are transtected with the plasmid vector 
DNA. Expression systems for expressing a polypeptide of the present invention are 
available using Bacillus sp. and Salmonella (Palva et al. (1983) Gene 22:229-235); 
Mosbach et al. (1983) Nature 302:543-545). 

10 A variety of eukaryotic expression systems such as yeast, insect cell lines, 

plant and mammalian cells, are known to those of skill in the art. As explained briefly 
below, a polynucleotide of the present invention can be expressed in these eukaryotic 
systems. In some embodiments, transformed/transfected plant cells, as discussed 
infra, are employed as expression systems for production of the polypeptides of the 

1 5 instant invention. 

The sequences of the present invention can also be ligated to various 
expression vectors for use in transfecting cell cultures of, for instance, mammalian, 
insect, or plant origin. Illustrative cell cultures useful for the production of the 
peptides are mammalian cells. A number of suitable host cell lines capable of 
20 expressing intact proteins have been developed in the art, and include the HEK293, 
BHK21, and CHO cell lines. Expression vectors for these cells can include 
expression control sequences, such as an origin of replication, a promoter (e.g. the 
CMV promoter, a HSV tk promoter or pgk (phosphoglycerate kinase) promoter), an 
enhancer (Queen et al. (1986) Immunol. Rev. 89:49), and necessary processing 
25 information sites, such as ribosome binding sites, RNA splice sites, polyadenylation 
sites (e.g., an SV40 large T Ag poly A addition site), and transcriptional terminator 
sequences. Other animal cells useful for production of polypeptides of the present 
invention are available, for instance, from the American Type Culture Collection. 
Appropriate vectors for expressing polypeptides of the present invention in 
30 insect cells are usually derived from the SF9 baculovirus. Suitable insect cell lines 
include mosquito larvae, silkworm, armyworm, moth and Drosophila cell lines such 
as a Schneider cell line (See, Schneider (1987) J. Embryol. Exp. Morphol. 27:353- 
365). 
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As with yeast, when higher animal or plant host cells are employed, 
polyadenylation or transcription terminator sequences are typically incorporated into 
the vector. An example of a terminator sequence is the polyadenylation sequence 
from the bovine growth hormone gene. Sequences for accurate splicing of the 
5 transcript may also be included. An example of a splicing sequence is the VP1 intron 
from SV40 (Sprague et a/.(1983) J. Virol 45:773-781). Additionally, gene sequences 
to control replication in the host cell may be incorporated into the vector such as those 
found in bovine papilloma virus type-vectors (Saveria-Campo (1985) DNA Cloning 
Vol Ha Practical Approach D.M. Glover, Ed., IRL Press, Arlington, Virginia, pp. 
10 213-238). 

Animal and lower eukaryotic (e.g., yeast) host cells are competent or rendered 
competent for transfection by various means. There are several well-known methods 
of introducing DNA into animal cells. These include: calcium phosphate 
precipitation, fusion of the recipient cells with bacterial protoplasts containing the 
1 5 DNA, treatment of the recipient cells with liposomes containing the DNA, DEAE 
dextrin, electroporation, biolistics, and micro-injection of the DNA directly into the 
cells. The transfected cells are cultured by means well known in the art (Kuchler 
(1997) Biochemical Methods in Cell Culture and Virology, Dowden, Hutchinson and 
Ross, Inc.). 

20 Synthesis of heterologous nucleotide sequences in yeast is well known 

(Sherman et ah (1982) Methods in Yeast Genetics, Cold Spring Harbor Laboratory). 
Two widely utilized yeasts for production of eukaryotic proteins are Saccharomyces 
cerevisiae and Pichia pastoris. Vectors, strains, and protocols for expression in 
Saccharomyces and Pichia are known in the art and available from commercial 

25 suppliers (e.g., Invitrogen). Suitable vectors usually have expression control 
sequences, such as promoters, including 3-phosphoglycerate kinase or alcohol 
oxidase, and an origin of replication, termination sequences and the like as desired. 

A polypeptide of the present invention, once expressed, can be isolated from 
yeast by lysing the cells and applying standard protein isolation techniques to the 

30 lysates. The monitoring of the purification process can be accomplished by using 

Western blot techniques, UV absorption spectra, radioimmunoassay, or other standard 
immunoassay techniques. 
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The invention is drawn to a general method for identifying and making 
antimicrobial compositions, particularly antifungal compositions. The methods 
involve injection of an insect with a suspension of a plant pathogenic fungus to induce 
insect polypeptides possessing antimicrobial activity. Such polypeptides are isolated 
5 from the insect hemolymph using a combination of high-resolution liquid 
chromatography and mass spectrophotometry. 

The general strategy for the discovery of these insect-derived antimicrobial 
peptides involves challenging insects with a selected plant pathogen and collecting 
hemolymph and fat body samples at various times post-induction. For example, 
1 0 hemolymph and fat body samples can be collected at about 8 hour, 1 6 hour, 24 hour, 
or 48 hour intervals. It is recognized that any method for protein separation and 
identification may be used to isolate peptides and the corresponding nucleic acid 
sequences. 

While not bound by any particular method, identification of antimicrobial 
1 5 peptides active against the target pathogen may be achieved using an integrated 

proteomic, genomic, and miniaturized bioassay approach. This approach consists of 
separation of hemolymph isolated from induced insects. Any method of separation 
can be used including HPLC separation. Fractions from HPLC-aided separation may 
be separated into 30-second fractions in a microtiter plate format, i.e., 96 well 
20 microtiter plate. Fractions collected in this manner are dried down and directly used 
in a fungal growth assay (FGA) in which the dried fractions are resuspended in 100 ul 
of half strength potato dextrose broth containing a suspension of the target fungal 
pathogen. Fractions that contain antimicrobial peptides are identified in the FGA by 
their ability to inhibit fungal growth after several hours, generally 24 to 48 hours. 
These fractions are subjected to further purification in order to isolate individual 
peptides and the specific peptide responsible for the observed activity is determined 
by FGA. This peptide is subsequently N-terminally sequenced and its molecular 
weight determined by mass spectrometry to provide information to identify the 
corresponding gene from sequence data derived from the corresponding insect cDNA 
libraries. The complete amino acid sequence of the peptide is determined by 
translation of the nucleic acid sequence and the mature peptide identified based on 
both N-terminal sequence and molecular weight information. 



25 



30 
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The defensive agents of the invention encompasses the mature active peptides 
as well as unprocessed or prepro-forms of the peptides. Where a mature peptide has 
been isolated, the prepro sequence, or signal sequence, can be obtained by a number 
of general molecular biology techniques known in the art. 
5 As indicated, the defensive agents may be isolated from any insect of interest 

Of particular interest are insects living in harsh environments and insects that are 
natural plant predators. While any insect may be utilized, it may be beneficial to use 
insect predators of a particular plant of interest. For example, to obtain defensive 
agents for use in maize, while any insect may be used, maize insect predators may be 
10 beneficial. 

Although a defensive agent may be induced by a particular pathogen, it is 
anticipated that the defensive agent may be effective against one or more additional 
pathogens, including but not limited to, any of the pathogens listed above. 

The polypeptides are tested for antimicrobial activity using in vitro assays as 

15 described elsewhere herein. Isolated antimicrobial polypeptides are subjected to 
proteolysis, and the amino termini of the resulting proteolytic fragments are 
sequenced. Degenerate oligonucleotides encoding the amino terminal sequence tags 
are used to identify the antimicrobial polypeptide-encoding cDNA's from 
corresponding pathogen induced insect cDNA libraries. The nucleic acid molecules 

20 encoding the antimicrobial polypeptides are used for the transformation of plant cells 
to generate plants with enhanced disease resistance. Additionally, the compositions 
of the invention can be used to generate formulations possessing disease resistance 
activities. 

Methods for increasing pathogen resistance in a plant are provided. The 
25 methods involve stably transforming a plant with a DNA construct comprising a 

nucleotide sequence of a defensive agent of the invention operably linked to promoter 
that drives expression in a plant. Such methods may find use in agriculture 
particularly in limiting the impact of plant fungal pathogens on crop plants. The 
antimicrobial nucleotide sequences comprise the insect nucleic acid molecules of the 
30 invention and functional variants and fragments thereof. The choice of promoter will 
depend on the desired timing and location of expression of the antimicrobial 
nucleotide sequences. Promoters of the invention include constitutive, inducible, and 
tissue-preferred promoters. 
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As discussed above, the nucleotide sequences of the invention encode 
polypeptides with antimicrobial properties, particularly fungicidal properties. Hence, 
the sequences of the invention may enhance transgenic plant disease resistance by 
disrupting cellular function of plant pathogens, particularly plant fungal pathogens. 
5 However, it is recognized that the present invention is not dependent upon a particular 
mechanism of defense. Rather, the compositions and methods of the invention work 
to increase resistance of the plant to pathogens independent of how that resistance is 
increased or achieved. 

The methods of the invention can be used with other methods available in the 
1 0 art for enhancing disease resistance in plants. Similarly, the antimicrobial 

compositions described herein may be used alone or in combination with other 
nucleotide sequences, polypeptides, or agents to protect against plant diseases and 
pathogens. Although any one of a variety of second nucleotide sequences may be 
utilized, specific embodiments of the invention encompass those second nucleotide 
1 5 sequences that, when expressed in a plant, help to increase the resistance of a plant to 
pathogens. 

Proteins, peptides, and lysozymes that naturally occur in insects (Jaynes et al. 
(1987) Bioassays 6:263-270), plants (Broekaert et al. (1997) Critical Reviews in Plant 
Sciences 16:297-323), animals (Vunnam et al. (1997) J. Peptide Res. 49:59-66), and 
20 humans (Mitra and Zang (1 994) Plant Physiol. 1 06:977-981 ; Nakajima et al. (1 997) 
Plant Cell Reports 16:674-679) are also a potential source of plant disease resistance 
(Ko, K. (2000) http://ww.scisoc.org/feature/m^ 

Examples of such plant resistance-conferring sequences include those encoding 
sunflower rhoGTPase-Activating Protein (rhoGAP), lipoxygenase (LOX), Alcohol 

25 Dehydrogenase (ADH), and Sc/erottWInducible Protein-1 (SCIP-1) described in US 
application 09/714,767, herein incorporated by reference. These nucleotide sequences 
enhance plant disease resistance through the modulation of development, 
developmental pathways, and the plant pathogen defense system. Other plant defense 
proteins include those described in WO 99/43823 and WO 99/43821, all of which are 

30 herein incorporated by reference. It is recognized that such second nucleotide 

sequences may be used in either the sense or antisense orientation depending on the 
desired outcome. 
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In one embodiment of the invention, at least one expression cassette 
comprising a nucleic acid molecule encoding the Magi polypeptide set forth in SEQ 
ID NO:2 is stably incorporated into a rice plant host, to confer on the plant enhanced 
disease resistance to fungal pathogens, particularly the pathogen M grisea. While the 
5 choice of promoter will depend on the desired timing and location of expression of the 
Magi nucleotide sequence, preferred promoters include constitutive andpathogen- 
inducible promoters. By "inducible" is intended the ability of the promoter sequence 
to regulate expression of an operably linked nucleotide sequence in response to a 
stimulus. In the case of a pathogen-inducible promoter, regulation of expression will 

10 be in response to a pathogen-derived stimulus. 

Another embodiment of the invention involves the stable incorporation of at 
least one expression cassette comprising a nucleotide sequence encoding at least one 
of the Rhizocl polypeptide set forth in SEQ ID NO: 12, the Rhizoc2 polypeptide set 
forth in SEQ ID NO:4, or the Rhizoc3 polypeptide set forth in SEQ ID NO: 16 into a 

1 5 rice plant host to confer on the plant enhanced disease resistance to flingal pathogens, 
particularly the pathogen R. solani. While the choice of promoter will depend on the 
desired timing and location of expression of the Magi nucleotide sequence, preferred 
promoters include constitutive and pathogen-inducible promoters. 

An additional embodiment of the invention involves the stable incorporation 

20 of at least one expression cassette comprising a nucleotide sequence encoding at least 
one of the Rhizocl polypeptide set forth in SEQ ID NO: 1 2 or the Fusl polypeptide 
set forth in SEQ ID NO: 14 into a corn plant host to confer on the plant enhanced 
disease resistance to fungal pathogens, particularly the pathogen F. verticilloides. In 
an embodiment the nucleotide sequence is a codon-biased sequence, such as the 

25 codon-biased sequence set forth in SEQ ID NO: 122, 124, 126, or 128. While the 

choice of promoter will depend on the desired timing and location of expression of the 
Magi nucleotide sequence, preferred promoters include constitutive and pathogen- 
inducible promoters. 

In an embodiment of the invention, the polypeptides of the invention can be 

30 formulated with an acceptable carrier into an antimicrobial composition(s) that is for 
example, a suspension, a solution, an emulsion, a dusting powder, a dispersible 
granule, a wettable powder, and an emulsifiable concentrate, an aerosol, an 
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impregnated granule, an adjuvant, or a coatable paste, and also in encapsulations, for 
example, polymer substances. 

In another embodiment, the defensive agents comprise isolated polypeptides 
of the invention. The defensive agents of the invention find use in the 
5 decontamination of plant pathogens during the processing of grain for animal or 
human food consumption; during the processing of feedstuffs, and during the 
processing of plant materia] for silage. In this embodiment, the defensive agents of 
the invention, are presented to grain, plant material for silage, or a contaminated food 
crop, or during an appropriate stage of the processing procedure, in amounts effective 
10 for anti-microbial activity. The compositions can be applied to the environment of a 
plant pathogen by, for example, spraying, atomizing, dusting, scattering, coating or 
pouring, introducing into or on the soil, introducing into irrigation water, by seed 
treatment, or dusting at the time when the plant pathogen has begun to appear or 
before the appearance of pests as a protective measure. It is recognized that any 
1 5 means to bring the defensive agent polypeptides in contact with the plant pathogen 
can be used in the practice of the invention. 

Methods are provided for controlling plant pathogens comprising applying a 
decontaminating amount of a polypeptide or composition of the invention to the 
environment of the plant pathogen. The polypeptides of the invention can be 
20 formulated with an acceptable carrier into a composition(s) that is, for example, a 
suspension, a solution, an emulsion, a dusting powder, a dispersible granule, a 
wettable powder, an emulsifiable concentrate, an aerosol, an impregnated granule, an 
adjuvant, a coatable paste, and also encapsulations in, for example, polymer 
substances. 

25 Such compositions disclosed above may be obtained by the addition of a 

surface-active agent, an inert carrier, a preservative, a humectant, a feeding stimulant, 
an attractant, an encapsulating agent, a binder, an emulsifier, a dye, a UV protectant, a 
buffer, a flow agent or fertilizers, micronutrient donors or other preparations that 
influence plant growth. One or more agrochemicals including, but not limited to, 

30 herbicides, insecticides, fungicides, bacteriocides, nematocides, molluscicides, 

acaracides, plant growth regulators, harvest aids, and fertilizers, can be combined with 
carriers, surfactants, or adjuvants customarily employed in the art of formulation or 
other components to facilitate product handling and application for particular target 
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mycotoxins. Suitable carriers and adjuvants can be solid or liquid and correspond to 
the substances ordinarily employed in formulation technology, e.g., natural or 
regenerated mineral substances, solvents, dispersants, wetting agents, tackifiers, 
binders, or fertilizers. The active ingredients of the present invention are normally 
5 applied in the form of compositions and can be applied to the crop area or plant to be 
treated, simultaneously or in succession, with other compounds. In some 
embodiments, methods of applying an active ingredient of the present invention or an 
agrochemical composition of the present invention (which contains at least one of the 
proteins of the present invention) are foliar application, seed coating, and soil 
10 application. 

Suitable surface-active agents include, but are not limited to, anionic 
compounds such as a carboxy late of, for example, a metal; a carboxy late of a long 
chain fatty acid; an N-acylsarcosinate; mono or di-esters of phosphoric acid with fatty 
alcohol ethoxylates or salts of such esters; fatty alcohol sulfates such as sodium 
1 5 dodecyl sulfate, sodium octadecyl sulfate, or sodium cetyl sulfate; ethoxylated fatty 
alcohol sulfates; ethoxylated alkylphenol sulfates; lignin sulfonates; petroleum 
sulfonates; alkyl aryl sulfonates such as alkyl-benzene sulfonates or lower 
alkylnaphtalene sulfonates, e.g.,, butyl-naphthalene sulfonate; salts of sulfonated 
naphthalene-formaldehyde condensates; salts of sulfonated phenol-formaldehyde 
20 condensates; more complex sulfonates such as the amide sulfonates, e.g., the 

sulfonated condensation product of oleic acid and N-methyl taurine; or the dialkyl 
sulfosuccinates, e.g., the sodium sulfonate or dioctyl succinate. Non-ionic agents 
include condensation products of fatty acid esters, fatty alcohols, fatty acid amides or 
fatty-alkyl- or alkenyl-substituted phenols with ethylene oxide, fatty esters of 
25 polyhydric alcohol ethers, e.g. , sorbitan fatty acid esters, condensation products of 
such esters with ethylene oxide, e.g. polyoxy ethylene sorbitar fatty acid esters, block 
copolymers of ethylene oxide and propylene oxide, acetylenic glycols such as 2, 4, 7, 
9-tetraethyl-5-decyn-4, 7-diol, or ethoxylated acetylenic glycols. Examples of a 
cationic surface-active agent include, for instance, an aliphatic mono-, di-, or 
30 polyamine such as an acetate, naphthenate, or oleate; or oxygen-containing amine 
such as an amine oxide of polyoxyethylene alkylamine; an amide-linked amine 
prepared by the condensation of a carboxylic acid with a di- or polyamine; or a 
quaternary ammonium salt. 
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Examples of inert materials include, but are not limited to, inorganic minerals 
such as kaolin, phyllosilicates, carbonates, sulfates, phosphates, or botanical materials 
such as cork, powdered corncobs, peanut hulls, rice hulls, and walnut shells. 

The compositions of the present invention can be in a suitable form for direct 
5 application or as concentrate of primary composition, which requires dilution with a 
suitable quantity of water or other diluent before application. The decontaminating 
concentration will vary depending upon the nature of the particular formulation, 
specifically, whether it is a concentrate or to be used directly. 

In a further embodiment, the compositions, as well as the polypeptides of the 
1 0 present invention can be treated prior to formulation to prolong the activity when 
applied to the environment of a plant pathogen as long as the pretreatment is not 
deleterious to the activity. Such treatment can be by chemical and/or physical means 
as long as the treatment does not deleteriously affect the properties of the 
composition(s). Examples of chemical reagents include, but are not limited to, 
1 5 halogenating agents; aldehydes such as formaldehyde and glutaraldehyde; anti- 

infectives, such as zephiran chloride; alcohols, such as isopropanol and ethanol; and 
histological fixatives, such as Bouin's fixative and Helly's fixative (see, for example, 
Humason (1961) Animal Tissue Techniques (W.H. Freeman and Co.). 

In an embodiment of the invention, the compositions of the invention 
20 comprise a microbe having stably integrated the nucleotide sequence of a defensive 
agent. The resulting microbes can be processed and used as a microbial spray. Any 
suitable microorganism can be used for this purpose. See, for example, Gaertner et al. 
(1 993) in Advanced Engineered Pesticides, Kim (Ed.). In one embodiment, the 
nucleotide sequences of the invention are introduced into microorganisms that 
25 multiply on plants (epiphytes) to deliver the defensive agents to potential target crops. 
Epiphytes can be, for example, gram-positive or gram-negative bacteria. 

It is further recognized that whole, i.e., unlysed, cells of the transformed 
microorganism can be treated with reagents that prolong the activity of the 
polypeptide produced in the microorganism when the microorganism is applied to the 
30 environment of a target plant. A secretion signal sequence may be used in 

combination with the gene of interest such that the resulting enzyme is secreted 
outside the microorganism for presentation to the target plant. 

In this manner, a gene encoding a defensive agent of the invention may be 
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introduced via a suitable vector into a microbial host, and said transformed host 
applied to the environment, plants, or animals. Microorganism hosts that are known 
to occupy the "phytosphere" (phylloplane, phyllosphere, rhizosphere, and/or 
rhizoplane) of one or more crops of interest may be selected for transformation. 
5 These microorganisms are selected so as to be capable of successfully competing in 
the particular environment with the wild-type microorganisms, to provide for stable 
maintenance and expression of the gene expressing the detoxifying polypeptide, and 
for improved protection of the enzymes of the invention from environmental 
degradation and inactivation. 
10 Such microorganisms include bacteria, algae, and fungi. Of particular interest 

are microorganisms, such as bacteria, e.g., Pseudomonas, Erwinia, Serratia, 
Klebsiella, Xanthomonas, Streptomyces, Rhizobium, Rhodopseudomonas, Methylius, 
Agrobacterium, Acetobacter, Lactobacillus, Arthrobacter, Azotobacter, Leuconostoc, 
and Alcaligenes; fungi, particularly yeast, e.g., Saccharomyces, Pichia, Cryptococcus, 
1 5 Kluyveromyces, Sporobolomyces, Rhodotorula, Aureobasidium, and Gliocladium. Of 
particular interest are such phytosphere bacterial species as Pseudomonas syringae, 
Pseudomonas fluorescens, Serratia marcescens, Acetobacter xylinum, Agrobacteria, 
Rhodopseudomonas spheroides, Xanthomonas campestris, Rhizobium melioti, 
Alcaligenes enfrophus, Clavibacter xyli, and Azotobacter vinlandii; and phytosphere 
20 yeast species such as Rhodotorula rubra, R. glutinis, R. marina, R. aurantiaca, 

Cryptococcus albidus, C diffluens, C. laurentii, Saccharomyces rosei, S. pretoriensis, 
S. cerevisiae, Sporobolomyces rosues, S. odorus, Kluyveromyces veronae, and 
Aureobasidium pullulans. 

Illustrative prokaryotes, both Gram-negative and -positive, include 
25 Enter obacteriaceae, such as Escherichia, Erwinia, Shigella, Salmonella, and Proteus; 
Bacillaceae; Rhizobiaceae, such as Rhizobium; Spirillaceae, such as photobacterium, 
Zymomonas, Serratia, Aeromonas, Vibrio, Desulfovibrio, Spirillum; 
Lactobacillaceae; Pseudomonadaceae, such as Pseudomonas and Acetobacter; 
Azotobacteraceae; and Nitrobacteraceae. Among eukaryotes are fungi, such as 
30 Phycomycetes and Ascomycetes, which includes yeast, such as Saccharomyces and 
Schizosaccharomyces; and Basidiomycetes yeast, such as Rhodotorula, 
Aureobasidium, Sporobolomyces, and the like. 
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In an embodiment of the invention, the defensive agents of the invention may 
be used as a pharmaceutical compound for treatment of fungal and microbial 
pathogens in humans and other animals. Diseases and disorders caused by fungal and 
microbial pathogens include but are not limited to fungal meningoencephalitis, 
5 superficial fungal infections, ringworm, Athlete's foot, histoplasmosis, candidiasis, 
thrush, coccidioidoma, pulmonary cryptococcus, trichosporonosis, piedra, tinea nigra, 
fungal keratitis, onychomycosis, tinea capitis, chromomycosis, aspergillosis, 
endobronchial pulmonary aspergillosis, mucormycosis, chromoblastomycosis, 
dcrmatophytosis, tinea, fusariosis, pityriasis, mycetoma, pseudallescheriasis, and 
10 sporotrichosis. 

The compositions of the invention may be used as pharmaceutical compounds 
to provide treatment for diseases and disorders associated with, but not limited to, the 
following fungal pathogens: Histoplasma capsulatum, Candida spp. (C. albicans, C. 
tropicalis, C. parapsilosis, C. guilliermondii, C. glabrataJTorulopsis glabrata, C. 
1 5 krusei, C. lusitaniae), Aspergillus Jumigatus, A. flavus, A: niger, Rhizopus spp., 

Rhizomucor spp., Cunninghamella spp., Apophysomyces spp., Saksenaee spp., Mucor 
spp., andAbsidia spp. Efficacy of the compositions of the invention as anti-fungal 
treatments may be determined through anti-fungal assays known to one of skill in the 
art. 

20 The defensive agents may be administered to a patient through numerous 

means. Systemic administration can also be by transmucosal or transdermal means. 
For transmucosal or transdermal administration, penetrants appropriate to the barrier 
to be permeated are used in the formulation. Such penetrants are generally known in 
the art, and include, for example, for transmucosal administration, detergents, bile 

25 salts, and fusidic acid derivatives. Transmucosal aaministration can be accomplished 
through the use of nasal sprays or suppositories. For transdermal administration, the 
active compounds are formulated into ointments, salves, gels, or creams as generally 
known in the art. The compounds can also be prepared in the form of suppositories 
(e.g., with conventional suppository bases such as cocoa butter and other glycerides) 

30 or retention enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
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Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the 
art. The materials can also be obtained commercially from Alza Corporation and 
5 Nova Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to 
infected cells with monoclonal antibodies to viral antigens) can also be used as 
pharmaceutical ly acceptable carriers. These can be prepared according to methods 
known to those skilled in the art, for example, as described in U.S. Patent No. 
4,522,81 1. 

10 It is especially advantageous to formulate oral or parenteral compositions in 

dosage unit form for ease of administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units suited as unitary dosages for the 
subject to be treated with each unit containing a predetermined quantity of active 
compound calculated to produce the desired therapeutic effect in association with the 

1 5 required pharmaceutical carrier. Depending on the type and severity of the disease, 
about 1 ng/kg to about 15 mg/kg (e.g., 0.1 to 20 mg/kg) of antibody is an initial 
candidate dosage for administration to the patient, whether, for example, by one or 
more separate administrations, or by continuous infusion. A typical daily dosage 
might range from about 1 p.g/kg to about 100 mg/kg or more, depending on the 

20 factors mentioned above. For repeated administrations over several days or longer, 
depending on the condition, the treatment is sustained until a desired suppression of 
disease symptoms occurs. However, other dosage regimens may be useful. The 
progress of this therapy is easily monitored by conventional techniques and assays. 
An exemplary dosing regimen is disclosed in WO 94/04188. The specification for the 

25 dosage unit forms of the invention are dictated by and directly dependent on the 

unique characteristics of the active compound and the particular therapeutic effect to 
be achieved, and the limitations inherent in the art of compounding such an active 
compound for the treatment of individuals. 

"Treatment" is herein defined as the application or administration of a 

30 therapeutic agent to a patient, or application or administration of a therapeutic agent to 
an isolated tissue or cell line from a patient, who has a disease, a symptom of disease 
or a predisposition toward a disease, with the purpose to cure, heal, alleviate, relieve, 
alter, remedy, ameliorate, improve or affect the disease, the symptoms of disease or 
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the predisposition toward disease. A "therapeutic agent" includes, but is not limited 
to, small molecules, peptides, antibodies, ribozymes and antisense oligonucleotides. 

The defensive agents of the invention can be used for any application 
including coating surfaces to target microbes. In this manner, target microbes include 
5 human pathogens or microorganisms. Surfaces that might be coated with the 
defensive agents of the invention include carpets and sterile medical facilities. 
Polymer bound polypeptides of the invention may be used to coat surfaces. Methods 
for incorporating compositions with anti-microbial properties into polymers are 
known in the art. See U.S. Patent No.5,847,047 herein incorporated by reference. 
1 0 ^ isolated polypeptide of the invention can be used as an immunogen to 

generate antibodies that bind defensive agents using standard techniques for 
polyclonal and monoclonal antibody preparation. The full-length defensive agents 
can be used or, alternatively, the invention provides antigenic peptide fragments of 
defensive agents for use as immunogens. The antigenic peptide of a defensive agent 
1 5 comprises at least 8, preferably 1 0, 1 5, 20, or 30 amino acid residues of the amino 
acid sequence shown in SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16, 22, 23, 25, 26, 28, 29, 
31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 
65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 80, 82, 83, 85, 86, 88, 89, 91, 92, 94, 95, 96, 97, 
98, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, or 127, and 
20 encompasses an epitope of a defensive agent such that an antibody raised against the 
peptide forms a specific immune complex with the anti-microbial polypeptides. 
Preferred epitopes encompassed by the antigenic peptide are regions of defensive 
agents that are located on the surface of the protein, e.g., hydrophilic regions. 

Accordingly, another aspect of the invention pertains to anti-defensive agent 
25 polyclonal and monoclonal antibodies that bind a defensive agent. Polyclonal 

defensive agent-like antibodies can be prepared by immunizing a suitable subject 
(e.g., rabbit, goat, mouse, or other mammal) with an defensive agent-like immunogen. 
The anti-defensive agent antibody titer in the immunized subject can be monitored 
over time by standard techniques, such as with an enzyme linked immunosorbent 
30 assay (ELISA) using immobilized anti-microbial polypeptides. At an appropriate 
time after immunization, e.g., when the anti-defensive agent antibody titers are 
highest, antibody-producing cells can be obtained from the subject and used to 
prepare monoclonal antibodies by standard techniques, such as the hybridoma 
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technique originally described by Kohler and Milstein (1975) Nature 256:495-497, 
die human B cell hybridoma technique (Kozbor et al (1983) Immunol Today 4:72), 
the EBV-hybridoma technique (Cole et al. (1985) in Monoclonal Antibodies and 
Cancer Therapy, ed. Reisfeld and Sell (Alan R. Liss, Inc., New York, NY), pp. 77-96) 
5 or trioma techniques. The technology for producing hybridomas is well known (see 
generally Coligan et aL, eds. (1994) Current Protocols in Immunology (John Wiley & 
Sons, Inc., New York, NY); Galfre et al. (1977) Nature 266:55052; Kenneth (1980) in 
Monoclonal Antibodies: A New Dimension In Biological Analyses (Plenum 
Publishing Corp., NY; and Lerner (1981) Yale J. Biol Med, 54:387-402). 

10 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal anti-defensive agent-like antibody can be identified and isolated by 
screening a recombinant combinatorial immunoglobulin library (e.g., an antibody 
phage display library) with a defensive agent to thereby isolate immunoglobulin 
library members that bind the defensive agent. Kits for generating and screening 

15 phage display libraries are commercially available (e.g., the Pharmacia Recombinant 
Phage Antibody System, Catalog No. 27-9400-01; and the Stratagene SurJZAP™ 
Phage Display Kit, Catalog No. 240612). Additionally, examples of methods and 
reagents particularly amenable for use in generating and screening antibody display 
library can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication 

20 Nos. WO 92/18619; WO 91/17271; WO 92/20791; WO 92/15679; 93/01288; WO 

92/01047; 92/09690; and 90/02809; Fuchs et al (1991) Bio/Technology 9:1370-1372; 
Hay et al. (1992) Hum. Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 
246:1275-1281; Griffiths et al. (1993) EMBO J. 12:725-734. The antibodies can be 
used to identify homologs of the defensive agents of the invention. 

25 The following examples are offered by way of illustration and not by way of 

limitation. 

EXPERIMENTAL 

Example 1 . Bioassay for Fungicidal Activity of Manduca sexta Hemolymph 
30 Polypeptides 

After resolution by liquid chromatography (LC), the various pathogen induced 
M. sexta polypeptide-containing fractions were assayed for fungicidal activity against 
the plant pathogens M. grisea, R. solani, and F. verticilloides. The LC fractions were 
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first lyophilized in 96-well microtitre plates. A suspension of 100 ul of M grisea (or 
other named pathogen), at the standard fiingal growth assay concentration (2500 
spores/ml), was added to the polypeptide containing microtitre plate wells, and the 
plates sealed with Borden® Sealwrap™. The plates were then placed at 28°C in a 
5 dark chamber for 24 hours. Hyphal growth was monitored using a dissecting 

microscope. The polypeptides contained in the wells that lacked hyphal growth, or 
that displayed reduced hyphal growth compared to control wells, were considered to 
possess fungicidal activity. Hyphal growth was scored again, 48 hours post 
inoculation, for a final determination of fungicidal activity. 

10 

Example 2. Induc tion of Antimicrobial Response in Manduca sexta 
Fifth instarM sexta larvae were injected intersegmental^ with 20 pi of a 
highly concentrated suspension of M. grisea hyphae and spores previously scraped 
from an agar plate colony. The larvae were then placed on fresh diet and allowed to 
1 5 recover. After 24, 48, and 72 hours, hemolymph was collected from the larvae by 
clipping off a proleg using fine surgical scissors over a sheet of parafilm™. 
Approximately 1 ml/insect can be collected in this way. The hemolymph was 
transferred to a 50 ml conical flask and placed on ice while the remaining larvae were 
being processed. Once all larvae have been processed, phenyl thiolurea was added to 
20 a final concentration of 20 mM. Aprotinin was also added to the sample (final 

concentration 20ug/ml). The samples were centrifuged (3000 rpm) for 5 minutes to 
pellet cells. The remaining supernatant (hemolymph) was subjected to solid-phase 
extraction using Supelco Discovery® DSC- 18 solid-phase extraction columns. The 
columns are preconditioned using 100% methanol, equilibrated using 100% Solvent 
25 A (5% acetonitrile, 0. 1 % TFA; 1 column volume) before the sample is loaded. After 
the hemolymph (supernatant) filters through, the column is washed with Solvent A 
before eluting with one column volume of 60% Solvent B/40% Solvent A (Solvent B: 
95% acetonitrile, 0.1% TFA). The collected eluent is frozen at -80 °C and lyophilized 
to dryness. Hemolymph samples are then resuspended in a small volume of water 
30 (usually 200 - 500 uL) and a BCA assay is done to determine protein concentration. 
Following the solid-phase fractionation step, the hemolymph samples are fractionated 
by HPLC and tested by bioassay. 

Induction ofM. sexta with B. bassiana and R. solani was performed similarly. 
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Corresponding pathogen (M grisea; B. bassiana; R. solani) induced M sexta 
cDN A libraries were constructed according to standard protocols. Briefly, total RNA 
was isolated from the fatbodies of pathogen induced M sexta. The mRNAs were 
isolated using an mRNA purification kit (BRL) according to the manufacture's 
5 instructions. The cDNA libraries were constructed using the ZAP-cDNA synthesis kit 
and the pBluescript phagemid (Stratagene). 

Example 3. HPLC-Fractionation of Polypeptides from Maznaportha grisea Induced 

Manduca sexta Hemolvmph 

10 Hemolymph from M. grisea induced M. sexta larvae (see Example 2) was 

fractionated on HP- 1 100 HPLC, using a Vydack C4 (4.6-250 mm) column (Figure 3). 
A gradient system was used to elute bound proteins. The gradient conditions are 
indicated below. Fractions were collected at one minute intervals into a 96-well 
microtiter plate and were assayed for fungicidal activity against M. grisea (see 

15 Example 1). 

This protocol was also followed for fractionation of polypeptides from B. 

bassiana and R. solani induced M sexta hemolymph. The bioassay for fungicidal 

activity (Example 1) was also conducted using the plant pathogens R. solani and F. 

verticilloides. 

20 Gradient Conditions: 
Solvent 

Solvent A: 5% Acetonitrile, 0. 1 % TFA 
Solvent B: 95% Acetonitrile, 0. 1 % TFA 
Flow rate 
25 0.6 ml/min 
Gradient 

0-60% B over 70 minutes 

30 Example 4 Microbore Purification of the Fungicidal Polypeptide, Magi 

After fractionation by HPLC, those fractions from Example 3 possessing 

fungicidal activity (47-52 min fractions) were further separated by microbore-LC 

(Michrome Bioresources) using a Vydack C4 column (1-1 50mm). The gradient 

conditions are indicated below. The column eluant was collected in such a manner as 

35 to best resolve the peaks with the highest polypeptide content (Figure 4). The eluted 

polypeptides were assayed for fungicidal activity against M grisea (See Example 1). 
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The polypeptide fraction containing the greatest fungicidal activity is indicated with 
an arrow. 

Gradient Conditions: 
Solvents 

5 Solvent A: 5% Acetonitrile, 0.1%TFA 
Solvent B: 95% Acetonitrile, 0.1%TFA 
Flow rate 
50f.il/min 
Gradient 

1 0 5-65% solvent B in 70 minutes 

The polypeptide fraction containing the greatest fungicidal activity (indicated 

with an arrow in Figure 4 was further resolved using microbore-LC (Michrome 

Bioresources) on a Vydack CIS (l-150mm) column (Figure 5). The gradient 

conditions follow. Again the polypeptide-containing fractions were assayed for 

fungicidal activity against M. grisea (See Example 1). (The resulting purified 

polypeptide was designated Magi.) 

Gradient Conditions 
Solvents 

20 Solvent A: 5% Acetonitrile, 0.1 %HFB A 
Solvent B: 95% Acetonitrile, 0.1%HFBA 
Flow rate 
50p,l/min 
Gradient 

25 5-65% solvent B in 70 minutes 

This protocol was also followed for microbore purification of fungicidal 
polypeptides identified in B. bassiana and R. solani induced M sexta hemolymph. 
The bioassay for fungicidal activity (Example 1) was also conducted using the plant 
30 pathogens R. solani and F. verticilloides. 

Example 5. Molecular Weight Determination of Map 1 
The molecular weight of the isolated Magi polypeptide from Example 4 was 
determined using Liquid Chromatography-Mass Spectrophotometry (LC-MS). The 
molecular mass of Magi was determined using electrospray mass spectrometry on a 
Micromass platform LCZ mass spectrometer (Micromass, Manchester, UK). A 
microbore LC (Michrom bioresources, Auburn CA) delivered the protein and mobile 
phase (acetonitrile/water) using a reversed phase column. Spectra were obtained in 
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positive ion mode using a capillary voltage of 3.5kV, a cone voltage of 45V, and a 
source temperature of 90°C. Spectra scanned over a range of 600-3000 at a rate of 3.5 
s/scan. Molecular masses were determined using the maximum entropy deconvolution 
algorithm (MaxEnt) to transform the m/z range 600-3000 to give a true mass scale 

5 spectrum. Mass calibration was performed using horse heart myoglobin. 

A similar protocol was performed for the other polypeptides of the invention. 



Example 6. Lys-C Endoproteinase Digestion of Magi 
Sequencing grade lyophilized endoproteinase Lys-C (Boehringer Mannheim) 
1 0 was reconstituted in 50 |al redistilled water resulting in a buffer concentration of 
50mM Tricine pH 8.0, 10 mM EDTA, and 0.5 mg/ml raffinose. The Magi 
polypeptide from Example 4 was dissolved in digestion buffer (25 mM Tris HC1 pH 
8.5, 1 nM EDTA) to a ratio of 1 :50 Lys-C to Magi polypeptide by weight. The 
reaction was allowed to proceed for 20 hours at 37°C. The digested polypeptide was 
1 5 fractionated using a C4 column on a microbore-HPLC with a gradient of 5-65% 

acetonitrile in 0.1% TFA over 70 minutes at a flow rate of 50^1/min (Figure 6). Four 
isolated fragments were collected and submitted for N-terminal sequence analysis. 

A similar protocol was followed for digestion of the other fungicidal 
polypeptides of the invention. 

20 

Example 7. N-Terminal Amino Acid Sequence Determination of Magi Polypeptide 

Fragments 

The N-termini of the isolated Magi fragments from Example 6 were 

sequenced on an ABI Procise® 494 Protein Sequencer, consisting of a chemistry 
25 workstation, a PTH analysis system, computer control and an automated sequence 

calling software. Standard protocols were used to run the system and determine the 

sequences (see Figure 7). 

The N-terminal amino acid sequences of isolated fragments of the other 

polypeptides of the invention were determined similarly. 
30 N-terminal peptide sequence is critical in determining the exact or precise 

processing site for the conversion of the pro-peptide into the mature and active form 

of the protein (as in this example, Magi). This is in particular important for secretory 

proteins. 
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C-terminal peptide sequence was deduced from both the molecular weight 
generated by LC-MS of the active protein and the predicted molecular weight of the 
same encoded polypeptide based of the identified cDNA sequence (in Example 8). 
By knowing the precise termini of the mature protein, one can design and 
5 construct DNA molecules that encode the entire active mature protein for expression 
in plants. When necessary, additional plant specific controlling elements and 
targeting sequences can be tailored and incorporated in the gene design in order to 
enhance and target the expression of the mature polypeptide in plants. 

To ensure the original specificity and functionality of the e.g. Magi protein 
retained in the plant, the expression of the active mature form of the protein in the 
plant is essential. 



10 



15 



20 



Example 8. Isolation of the cDNA Clone Encoding Map l 
Fat bodies were harvested directly into liquid nitrogen before processing. 
Total RNA from fatbodies of challenged Manduca sexta was prepared by pulverizing 
the tissue with a mortar and pestle in liquid nitrogen and lysing cells in the presence 
of TRIzoI (Life Technologies) according to the manufacturer's protocol. PolyA(+) 
RNA was oligo(dT)-cellulose affinity purified from total RNA using the mRNA 
Purification Kit (Amersham Pharmacia Biotech) following the manufacturer's 
protocol in preparation for cDNA library construction. First strand cDNA synthesis 
using Superscript n (Life Technologies) and subsequent second strand synthesis, 
linker addition, and directional cloning into restriction sites of pBlueScript SK+ 
(Stratagene) was performed according to the instructions provided with the Stratagene 
cDNA kit (Stratagene). cDNA was purified using a cDNA column (Life 
25 Technologies) immediately prior to ligation into the vector. 

Sequencing of the cDNA library clones was performed using the ABI PRISM 
Big Dye Terminator Cycle Sequencing Ready kit with FS AmpliTaq DNA 
polymerase (Perkin Elmer) and analyzed on an ABI Model 373 Automated DNA 
Sequencer. The Magi gene sequence was identified by sequencing about 2000 
30 clones of the cDNA library prepared from mRNA derived from the fatbodies of 
challenged M. sexta. Amino acid sequences derived from amino termini of the 
complete peptide or proteolytic cleavage products were used to compare to the 
Corresponding cDNA clone sequence library translated in the six possible frames. 
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Sequences containing 1 00% identity to the N-terminal amino acid sequences were 
fully translated and their predicted MW compared to the MW of the purified Magi 
protein. Sequences with comparable MWs were identified as probably encoding 
Magi. 

5 

Example 9. Isolation of the cDNA Clone Encoding a Polypeptide of Interest 
The N-terminal amino acid sequence tags of a polypeptide of interest are used 
to identify cDNA clones encoding the polypeptide. Degenerate oligonucleotides 
encoding the amino acid sequence tags of the polypeptide are used as probes to detect 

10 cDNA's encoding the polypeptide in a pathogen induced M. sexta cDNA library (see 
Example 2). In this manner a full-length cDNA encoding the polypeptide of interest 
is isolated and sequenced. Complete sequencing of the identified cDNA clone is 
performed to confirm that it encodes the purified polypeptide. Confirmation is 
provided by the predicted molecular weight of the cDNA encoded polypeptide being 

1 5 the same as the molecular weight of the polypeptide generated by LC-MS. 

Example 10. Construction of Recombinant Baculo virus Expressing Fungicidal 

Polypeptides 

The nucleotide sequences encoding the polypeptides of the invention may be 
20 introduced into the baculovirus genome itself. For this purpose the nucleotide 
sequences may be placed under the control of the polyhedrin promoter, the IE1 
promoter, or any other one of the baculovirus promoters. The cDNA, together with 
appropriate leader sequences is then inserted into a baculovirus transfer vector using 
standard molecular cloning techniques. Following transformation of E. coli DH5a, 
25 isolated colonies are chosen and plasmid DNA is prepared and is analyzed by 
restriction enzyme analysis. Colonies containing the appropriate fragment are 
isolated, propagated, and plasmid DNA is prepared for cotransfection. 

Example 11. Expression of Fungicidal Polypeptides in Insect Cells 
30 The polypeptides of the invention may be expressed in insect cells. For this 

purpose the Spodoptera jragiperda cells (Sf-9 or Sf-21) are propagated in 
ExCell® 401 media(JRH Biosciences, Lenexa, KS), or a similar media, supplemented 
with 3.0% fetal bovine serum. Lipofectin® (50pL at O.lmg/mL, Gibco/BRL) is added 

-59- 



.O2O86072A2_l_> 



WO 02/086072 PCT/US02/I251 ! 

to a 50yxL aliquot of the transfer vector containing the antimicrobial nucleotide 
sequences (500ng) and linearized polyhedrin-negative AcNPV (2.5pg, Baculogold® 
viral DNA, Pharmigen, San Diego, CA). Sf-9 cells (approximate 50% monolayer) are 
co-transfected with the viral DNA/transfer vector solution. The supernatant fluid 
5 from the co-transfection experiment is collected at 5 days post-transfection and 
recombinant viruses are isolated employing standard plaque purification protocols, 
wherein only polyhedrin-positive plaques are selected (O'Reilly et al. (1992), 
Baculovirus Expression Vectors: A Laboratory Manual, W. H. Freeman and 
Company, New York). Sf-9 cells in 35mm petri dishes (50% monolayer) are 

10 inoculated with 100 uL of a serial dilution of the viral suspension, and supernatant 
fluids are collected at 5 days post infection. In order to prepare larger quantities of 
virus for characterization, these supernatant fluids are used to inoculate larger tissue 
cultures for large scale propagation of recombinant viruses. Expression of the 
encoded fungicidal polypeptide by the recombinant baculovirus can be confirmed 

1 5 using a bioassay (such as described in Example 4), LC-MS, or antibodies. 



Example 12. Expression of Fungicidal Peptides mPichia 
The nucleotide sequences encoding the polypeptides of the invention may be 
expressed in Pichia under constitutive or inducible promoter control and targeted to 

20 remain intracellular or to be secreted into the media. The nucleotide sequences are 
cloned into a Pichia expression vector using standard molecular techniques. 
Transformation of Pichia strains (e.g. X-33, GSl 15, SMD1 168, KM71 etc - 
Invitrogen, Carlsbad, CA) involves linearization of the construct and introduction of 
the DNA into transformation competent Pichia cells by chemical means or by 

25 electroporation according to standard protocols. Transformants are selected by either 
resistance to Zeocin or blasticidin or by their ability to grow on histidine-deficient 
medium. Small scale expression tests are performed on selected transformants to 
identify high expressors of the polypeptides of the invention for additional scale up. 
In an inducible system, such as when the peptide is under control of the AOX1 

30 promoter, transformants are grown in media with glycerol as a carbon source and 
induced by growth in media containing methanol instead of glycerol. Continuous 
induction over a period of 24 - 120 hrs is achieved by addition of methanol (0.5% 
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final cone.) every 24 hr. Functional expression of the polypeptide is confirmed by 
LC-MS analysis/purification and bioassay. 



Example 13. Expression of Fungicidal Polypeptides in Bacteria 
5 The nucleotide sequences encoding the polypeptides of the invention may be 

expressed in bacteria and the peptides targeted for intracellular or extracellular 
expression. The cDNA's may be cloned into a suitable bacterial expression vector 
(e.g. pET vectors (Novagen, Madison, WI) under constitutive or inducible promoter 
control using standard molecular cloning techniques. The plasmid containing the 

10 gene of interest is introduced into transformation competent bacteria cells using 
standard protocols for chemical transformation or electroporation and the 
transformants are selected using antibiotic resistance. In addition to traditional E. coli 
strains commonly used for transformation, mutant strains such as Origami™ 
(Novagen) that are permissive for disulfide bond formation can be used, especially 

1 5 with cysteine-rich peptides to express functional peptides. Inducible systems such as 
E. coli strains bearing the T7 RNA polymerase gene (lambda- DE3 lysogen) can be 
used in which expression of the gene of interest under a T7 promoter is induced by 
addition of IPTG for variable periods of time. Expression and activity of the 
polypeptides are confirmed by LC-MS and bioassays. 

20 

Example 14. Transformation of Rice Embryogenic Callus by Bombardment and 

Regeneration of Transgenic Plants 
Embryogenic callus cultures derived from the scutellum of germinating seeds 
serve as the source material for transformation experiments. This material is 
25 generated by germinating sterile rice seeds on a callus initiation media (MS salts, 
Nitsch and Nitsch vitamins, 1 .0mg/l 2,4-D and 10 jjM AgN0 3 ) in the dark at 27- 
28° C. Embryogenic callus proliferating from the scutellum of the embryos is then 
transferred to CM media (N6 salts, Nitsch and Nitsch vitamins, 1 mg/1 2,4-D, Chu et 
ai, 1985, Set Sinica 18:659-668). Callus cultures are maintained on CM by routine 
30 sub-culture at two week intervals and used for transformation within 10 weeks of 
initiation. 

Callus is prepared for transformation by subculturing 0.5-1 .0 mm pieces 
approximately 1 mm apart, arranged in a circular area of about 4 cm in diameter, in 
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the center of a circle of Whatman #541 paper placed on CM media. The plates with 
callus are incubated in the dark at 27-28° C for 3-5 days. Prior to bombardment, the 
filters with callus are transferred to CM supplemented with 0.25 M mannitol and 0.25 
M sorbitol for 3 hours in the dark. The petri dish lids are then left ajar for 20-45 
5 minutes in a sterile hood to allow moisture on tissue to dissipate. 

Circular plasmid DNA from two different plasmids one containing the 
selectable marker for rice transformation and one containing the nucleotide of the 
invention, are co-precipitated onto the surface of gold particles. To accomplish this, a 
total of 1 0 ug of DNA at a 2: 1 ratio of trait: selectable marker DNAs is added to a 50 
ul aliquot of gold particles resuspended at a concentration of 60 mg/ml. Calcium 
chloride (50 ul of a 2.5 M solution) and spermidine (20 ul of a 0.1 M solution) are 
then added to the gold-DNA suspension as the tube is vortexing for 3 minutes. The 
gold particles are centrifuged in a microfuge for 1 sec and the supernatant removed. 
The gold particles are then washed twice with 1 ml of absolute ethanol and then 
resuspended in 50 ul of absolute ethanol and sonicated (bath sonicator) for one second 
to disperse the gold particles. The gold suspension is incubated at -70° C for five 
minutes and sonicated (bath sonicator) if needed to disperse the particles. Six 
microliters of the DNA-coated gold particles are then loaded onto mylar macrocarrier 
disks and the ethanol is allowed to evaporate. 

At the end of the drying period, a petri dish containing the tissue is placed in 
the chamber of the PDS-1 000/He. The air in the chamber is then evacuated to a 
vacuum of 28-29 inches Hg. The macrocarrier is accelerated with a helium shock 
wave using a rupture membrane that bursts when the He pressure in the shock tube 
reaches 1080-1 100 p.s.i. The tissue is placed approximately 8 cm from the stopping 
screen and the callus is bombarded two times. Five to seven plates of tissue are 
bombarded in this way with the DNA-coated gold particles. Following bombardment, 
the callus tissue is transferred to CM media without supplemental sorbitol or 
mannitol. 

Within 3-5 days after bombardment the callus tissue is transferred to SM 
media (CM medium containing 50 mg/l hygromycin). To accomplish this, callus 
tissue is transferred from plates to sterile 50ml conical tubes and weighed. Molten 
top-agar at 40° C is added using 2.5ml of top agar/100 mg of callus. Callus clumps 
are broken into fragments of less than 2mm diameter by repeated dispensing through 
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a 1 0 ml pipette. Three millili ter aliquots of the callus suspension are plated onto fresh 
SM media and the plates incubated in the dark for 4 weeks at 27-28° C. After 4 
weeks, transgenic callus events are identified, transferred to fresh SM plates and 
grown for an additional 2 weeks in the dark at 27-28° C. 
5 Growing callus is transferred to RM1 media (MS salts, Nitsch and Nitsch 

vitamins, 2% sucrose, 3% sorbitol, 0.4% gelrite + 50 ppm hyg B) for 2 weeks in the 
dark at 25° C. After 2 weeks the callus is transferred to RM2 media (MS salts, Nitsch 
and Nitsch vitamins, 3% sucrose, 0.4% gelrite + 50 ppm hyg B) and placed under cool 
white light (-40 ^Em" 2 s" ! ) with a 12 hr photoperiod at 25° C and 30-40% humidity. 

1 0 After 2-4 weeks in the light, callus generally begins to organize, and form shoots. 
Shoots are removed from surrounding callus/media and gently transferred to RM3 
media (1/2 x MS salts, Nitsch and Nitsch vitamins, 1% sucrose 4- 50 ppm hygromycin 
B) in phytatrays (Sigma Chemical Co., St. Louis, MO) and incubation is continued 
using the same conditions as described in the previous step. 

1 5 Plants are transferred from RM3 to 4" pots containing Metro mix 350 after 2-3 

weeks, when sufficient root and shoot growth has occurred. Plants are grown using a 
12 hr/12 hr light/dark cycle using -30/18° C day/night temperature regimen. 



Example 15. Transformation of Maize by Particle Bombardment and Regeneration of 

20 Transgenic Plants 

Immature maize embryos from greenhouse donor plants are bombarded with a 
plasmid containing a nucleotide sequence of the invention operably linked to a 
ubiquitin promoter and the selectable marker gene PAT (Wohlleben et al (1988) 
Gene 70:25-37), which confers resistance to the herbicide Bialaphos. Alternatively, 

25 the selectable marker gene is provided on a separate plasmid. Transformation is 
performed as follows. Media recipes follow below. 

Preparation of Target Tissue 

The ears are husked and surface sterilized in 30% Clorox bleach plus 0.5% 
30 Micro detergent for 20 minutes, and rinsed two times with sterile water. The 

immature embryos are excised and placed embryo axis side down (scutellum side up), 
25 embryos per plate, on 560 Y medium for 4 hours and then aligned within the 2.5- 
cm target zone in preparation for bombardment. 
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Preparation of DNA 

A plasmid vector comprising the nucleotide sequence of the invention 
operably linked to a ubiquitin promoter is made. This plasmid DNA plus plasmid 
5 DNA containing a PAT selectable marker is precipitated onto 1 . 1 um (average 
diameter) tungsten pellets using a CaCl 2 precipitation procedure as follows: 

100 u.1 prepared tungsten particles in water 
10 u.1 (lug) DNA in Tris EDTA buffer (1 ug total DNA) 
100u.l2.5MCaCl 2 
10 10 p.1 0.1M spermidine 

Each reagent is added sequentially to the tungsten particle suspension, while 
maintained on the multitube vortexer. The final mixture is sonicated briefly and 
allowed to incubate under constant vortexing for 10 minutes. After the precipitation 
period, the tubes are centrifuged briefly, liquid removed, washed with 500 ml 100% 
ethanol, and centrifuged for 30 seconds. Again the liquid is removed, and 105 uJ 
100% ethanol is added to the final tungsten particle pellet. For particle gun 
bombardment, the tungsten/DNA particles are briefly sonicated and 10 ul spotted 
onto the center of each macrocarrier and allowed to dry about 2 minutes before 
bombardment. 



15 



20 



25 



Particle Gun Treatment 

The sample plates are bombarded at level #4 in particle gun #HE34-1 or 
#HE34-2. All samples receive a single shot at 650 PSI, with a total often aliquots 
taken from each tube of prepared particles/DNA. 



Subsequent Treatment 

Following bombardment, the embryos are kept on 560Y medium for 2 days, 
then transferred to 560R selection medium containing 3 mg/liter Bialaphos, and 
subcultured every 2 weeks. After approximately 10 weeks of selection, selection- 
30 resistant callus clones are transferred to 288J medium to initiate plant regeneration. 
Following somatic embryo maturation (2-4 weeks), well-developed somatic embryos 
are transferred to medium for germination and transferred to the lighted culture room. 
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Approximately 7-10 days later, developing plantlets are transferred to 272V hormone- 
free medium in tubes for 7-10 days until plantlets are well established. Plants are then 
transferred to inserts in flats (equivalent to 2.5" pot) containing potting soil and grown 
for 1 week in a growth chamber, subsequently grown an additional 1-2 weeks in the 

5 greenhouse, then transferred to classic 600 pots (1 .6 gallon) and grown to maturity. 
Plants are monitored and scored for expression of the nucleotide sequence encoding 
the fungicidal polypeptide of the invention, or for the presence of the fungicidal 
polypeptide by immunological methods, or for fungicidal activity by assays known in 
the art, described supra herein. 

10 

Bombardment and Culture Media 

Bombardment medium (560Y) comprises 4.0g/l N6 basal salts (SIGMA C- 

1416), 1.0ml/l Eriksson's Vitamin Mix (1000X SIGMA-151 1), 0.5mg/l thiamine HC1, 

120.0g/l sucrose, l.Gmg/1 2,4-D, and 2.88g/l L-proline (brought to volume with D-I 

15 H 2 0 following adjustment to pH 5.8 with KOH); 2.0g/l Gelrite (added after bringing 
to volume with D-I H 2 0); and 8.5mg/l silver nitrate (added after sterilizing the 
medium and cooling to room temperature). Selection medium (560R) comprises 
4.0g/l N6 basal salts (SIGMA C-1416), 1.0ml/l Eriksson's Vitamin Mix (1000X 
SIGMA-151 1), 0.5mg/l thiamine HC1, 30.0g/l sucrose, and 2.0mg/l 2,4-D (brought to 

20 volume with D-I H 2 0 following adjustment to pH 5.8 with KOH); 3.0g/l Gelrite 
(added after bringing to volume with D-I H2O); and 0.85mg/l silver nitrate and 3.0 
mg/1 bialaphos (both added after sterilizing the medium and cooling to room 
temperature). 

Plant regeneration medium (288J) comprises 4.3g/l MS salts (GIBCO 11117- 
25 074), 5.0ml/l MS vitamins stock solution (0.1 OOg nicotinic acid, 0.02g/l thiamine 

HCL, 0.10g/l pyridoxine HCL, and 0.40g/l glycine brought to volume with polished 
D-I H 2 0) (Murashige and Skoog (1962) Physiol Plant. 15:473), 100mg/l myo- 
inositol, 0.5mg/l zeatin, 60g/l sucrose, and 1.0ml/l of 0.1 mM abscisic acid (brought to 
volume with polished D-I H 2 0 after adjusting to pH 5.6); 3.0g/l Gelrite (added after 
30 bringing to volume with D-I H 2 0); and 1 .0mg/l indoleacetic acid and 3.0mg/l 

bialaphos (added after sterilizing the medium and cooling to 60°C). Hormone-free 
medium (272V) comprises 4.3g/l MS salts (GIBCO 1 1 1 17-074), 5.0ml/l MS vitamins 
stock solution (0.100g/l nicotinic acid, 0.02g/l thiamine HCL, 0.10g/l pyridoxine 

HCL, and 0.40g/l glycine brought to volume with polished D-I H 2 0), 0.1 g/1 myo- 
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inositol, and 40.0g/l sucrose (brought to volume with polished D-I H 2 0 after adjusting 
pH to 5.6); and 6g/I bacto-agar (added after bringing to volume with polished D-I 
H 2 0), sterilized and cooled to 60° C. 

5 Example 16. Agrobacter ium-Mediatcd Transformation of Maize and Regeneration of 

Transgenic Plants 

For /!gro6ac/criww-mediated transformation of maize with a plant-optimized 
nucleotide sequence of the invention, preferably the method of Zhao is employed 
(U.S. Patent No. 5,981,840, and PCT patent publication W098/32326; the contents of 
0 which are hereby incorporated by reference). Briefly, immature embryos are isolated 
from maize and the embryos contacted with a suspension of Agiobacterium, where 
the bacteria are capable of transferring the plant-optimized nucleotide sequence of the 
invention to at least one cell of at least one of the immature embryos (step 1 : the 
infection step). In this step the immature embryos are preferably immersed in an 
5 Agi obacterium suspension for the initiation of inoculation. The embryos are co- 
cultured for a time with the Agi obacterium (step 2: the co-cultivation step). 
Preferably the immature embryos are cultured on solid medium following the 
infection step. Following this co-cultivation period an optional "resting" step is 
contemplated. In this resting step, the embryos are incubated in the presence of at 
least one antibiotic known to inhibit the growth of Agrobacterium without the 
addition of a selective agent for plant transformants (step 3 : resting step). Preferably 
the immature embryos are cultured on solid medium with antibiotic, but without a 
selecting agent, for elimination of Agrobacterium and for a resting phase for the 
infected cells. Next, inoculated embryos are cultured on medium containing a 
selective agent and growing transformed callus is recovered (step 4: the selection 
step). Preferably, the immature embryos are cultured on solid medium with a 
selective agent resulting in the selective growth of transformed ceils. The callus is 
then regenerated into plants (step 5: the regeneration step), and preferably calli grown 
on selective medium are cultured on solid medium to regenerate the plants. 
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Example 1 7. Transformation of Soybean Embryos and Regeneration of Transgenic 

Plants 

Soybean embryos are bombarded with a plasmid containing a nucleotide 
sequence of the invention operably linked to a ubiquitin promoter as follows. To 
5 induce somatic embryos, cotyledons, 3-5 mm in length dissected from surface- 
sterilized, immature seeds of the soybean cultivar A2872, are cultured in the light or 
dark at 26°C on an appropriate agar medium for six to ten weeks. Somatic embryos 
producing secondary embryos are then excised and placed into a suitable liquid 
medium. After repeated selection for clusters of somatic embryos that multiplied as 

10 early, globular-staged embryos, the suspensions are maintained as described below. 

Soybean embryogenic suspension cultures can maintained in 35 ml liquid media 
on a rotary shaker, 150 rpm, at 26°C with florescent lights on a 16:8 hour day/night 
schedule. Cultures are subcultured every two weeks by inoculating approximately 
35mg of tissue into 35ml of liquid medium. 

1 5 Soybean embryogenic suspension cultures may then be transformed by the 

method of particle gun bombardment (Klein et al. (1987) Nature (London) 327:70-73, 
U.S. Patent No. 4,945,050). A Du Pont Biolistic PDS1000/HE instrument (helium 
retrofit) can be used for these transformations. 

A selectable marker gene that can be used to facilitate soybean transformation is 

20 a transgene composed of the 35S promoter from Cauliflower Mosaic Virus (Odell et 
al. (1985) Nature 313:810-812), the hygromycin phosphotransferase gene from 
plasmid pJR225 (from £1 coli; Gritz et al. (1983) Gene 25:179>188), and the 3' region 
of the nopaline synthase gene from the T-DNA of the Ti plasmid of Agrobacterium 
tumefaciens. The expression cassette comprising the nucleotide sequence of the 

25 invention operably linked to the ubiquitin promoter can be isolated as a restriction 
fragment. This fragment can then be inserted into a unique restriction site of the 
vector carrying the marker gene. 

To 50 \i\ of a 60 mg/ml 1 ^un gold particle suspension is added (in order): 5 |il 
DNA (1 jig/^il), 20 p.1 spermidine (0.1 M), and 50 \s\ CaCl2 (2.5 M). The particle 

30 preparation is then agitated for three minutes, spun in a microfiige for 10 seconds and 
the supernatant removed. The DNA-coated particles are then washed once in 400 jxl 
70% ethanol and resuspended in 40^1 of anhydrous ethanol. The DNA/particle 
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suspension can be sonicated three times for one second each. Five microliters of the 
DNA-coated gold particles are then loaded on each macro carrier disk. 

Approximately 300-400 mg of a two-week-old suspension culture is placed in 
an empty 60x1 5mm petri dish and the residual liquid removed from the tissue with a 
5 pipette. For each transformation experiment, approximately 5-10 plates of tissue are 
normally bombarded. Membrane rupture pressure is set at 1 100 psi, and the chamber 
is evacuated to a vacuum of 28 inches mercury. The tissue is placed approximately 
3.5 inches away from the retaining screen and bombarded three times. Following 
bombardment, the tissue can be divided in half and placed back into liquid and 

10 cultured as described above. 

Five to seven days post bombardment, the liquid media may be exchanged with 
fresh media, and eleven to twelve days post-bombardment with fresh media 
containing 50mg/ml hygromycin. This selective media can be refreshed weekly. 
Seven to eight weeks post-bombardment, green, transformed tissue may be observed 

1 5 growing from untransformed, necrotic embryogenic clusters. Isolated green tissue is 
removed and inoculated into individual flasks to generate new, clonally propagated, 
transformed embryogenic suspension cultures. Each new line may be treated as an 
independent transformation event. These suspensions can then be subcultured and 
maintained as clusters of immature embryos or regenerated into whole plants by 

20 maturation and germination of individual somatic embryos. 

Example 18. Transform ation of Sunflower Meristem Tissue and Regeneration of 

Transgenic Plants 

Sunflower meristem tissues are transformed with an expression cassette 
25 containing the nucleotide sequence of the invention operably linked to a ubiquitin 
promoter as follows (see also European Patent Number EP 0 486233, herein 
incorporated by reference, and Malone-Schoneberg et al (1994) Plant Science 
103:199-207). Mature sunflower seed (Helianthus annuus L.) are dehulled using a 
single wheat-head thresher. Seeds are surface sterilized for 30 minutes in a 20% 
30 Clorox bleach solution with the addition of two drops of Tween 20 per 50 ml of 
solution. The seeds are rinsed twice with sterile distilled water. 

Split embryonic axis explants are prepared by a modification of procedures 
described by Schrammeijer et al (Schrammeijer et al (1990) Plant Cell Rep, 9:55- 
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60). Seeds are imbibed in distilled water for 60 minutes following the surface 
sterilization procedure. The cotyledons of each seed are then broken off, producing a 
clean fracture at the plane of the embryonic axis. Following excision of the root tip, 
the explants are bisected longitudinally between the primordial leaves. The two 
5 halves are placed, cut surface up, on GBA medium consisting of Murashige and 
Skoog mineral elements (Murashige et al. (1962) Physiol. Plant, 15: 473-497), 
Shepard's vitamin additions (Shepard (1980) in Emergent Techniques for the Genetic 
Improvement of Crops (University of Minnesota Press, St. Paul, Minnesota), 40 mg/1 
adenine sulfate, 30 g/1 sucrose, 0.5 mg/1 6-benzyl-aminopurine (BAP), 0.25 mg/1 
10 indole-3 -acetic acid (IAA), 0.1 mg/1 gibberellic acid (GA3), pH 5.6, and 8g/l 
Phytagar. 

The explants are subjected to microprojectile bombardment prior to 
Agrobacterium treatment (Bidney et al (1992) Plant Mol Biol 18: 301-313). Thirty 
to forty explants are placed in a circle at the center of a 60 X 20mm plate for this 
1 5 treatment. Approximately 4.7mg of 1 .8mm tungsten microprojectiles are resuspended 
in 25ml of sterile TE buffer (10 mM Tris HC1, 1 mM EDTA, pH 8.0) and 1 .5ml 
aliquots are used per bombardment. Each plate is bombarded twice through a 1 50 

mm nytex screen placed 2 cm above the samples in a PDS 1000® particle 
acceleration device. 

20 Disarmed Agrobacterium tnmefaciens strain EHA105 is used in all 

transformation experiments. A binary plasmid vector comprising the expression 
cassette that contains the nucleotide sequence of the invention operably linked to the 
ubiquitin promoter is introduced into Agrobacterium strain EHA105 via freeze- 
thawing as described by Holsters et al (1978) Mol Gen. Genet. 163:181-187. This 

25 plasmid further comprises a kanamycin selectable marker gene (i.e, nptlJ). Bacteria 
for plant transformation experiments are grown overnight (28°C and 100 RPM 
continuous agitation) in liquid YEP medium (10 gm/1 yeast extract, 10 gm/1 
Bactopeptone, and 5 gm/1 NaCl, pH 7.0) with the appropriate antibiotics required for 
bacterial strain and binary plasmid maintenance. The suspension is used when it 

30 reaches an OD600 of about 0.4 to 0.8. The Agrobacterium cells are pelleted and 

resuspended at a final OD600 of 0.5 in an inoculation medium comprised of 12.5 mM 
MES pH 5.7, 1 gm/1 NH4CI, and 0.3 gm/1 MgSOzj. 
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Freshly bombarded explants are placed in an Agrobacterium suspension, 
mixed, and left undisturbed for 30 minutes. The explants are then transferred to GBA 
medium and co-cultivated, cut surface down, at 26°C and 18-hour days. After three 
days of co-cultivation, the explants are transferred to 374B (GBA medium lacking 
5 growth regulators and a reduced sucrose level of 1 %) supplemented with 250 mg/1 
cefotaxime and 50 mg/1 kanamycin sulfate. The explants are cultured for two to five 
weeks on selection and then transferred to fresh 374B medium lacking kanamycin for 
one to two weeks of continued development. Explants with differentiating, antibiotic- 
resistant areas of growth that have not produced shoots suitable for excision are 
1 0 transferred to GBA medium containing 250 mg/1 cefotaxime for a second 3-day 

phytohormone treatment. Leaf samples from green, kanamycin-resistant shoots are 
assayed for the presence of NPTII by ELISA and for the presence of transgene 
expression by assaying for expression of the nucleotide sequence encoding the 
fungicidal polypeptide of the invention, the presence of the fungicidal polypeptide by 
1 5 immunological methods, or for fungicidal activity by assays known in the art, 
described supra herein. 

NPTII-positive shoots are grafted to Pioneer® hybrid 6440 in vitro-groxm 
sunflower seedling rootstock. Surface sterilized seeds are germinated in 48-0 medium 
(half-strength Murashige and Skoog salts, 0.5% sucrose, 0.3% gelrite, pH 5.6) and 
20 grown under conditions described for explant culture. The upper portion of the 
seedling is removed, a 1cm vertical slice is made in the hypocotyl, and the 
transformed shoot inserted into the cut. The entire area is wrapped with parafilm to 
secure the shoot. Grafted plants can be transferred to soil following one week of in 
vitro culture. Grafts in soil are maintained under high humidity conditions followed 
25 by a slow acclimatization to the greenhouse environment. Transformed sectors of T 0 
plants (parental generation) maturing in the greenhouse are identified by NPTII 
ELISA and/or by the fungicidal activity analysis of leaf extracts while transgenic 
seeds harvested from NPTII-positive T 0 plants are identified by fungicidal activity 
analysis of small portions of dry seed cotyledon. 
30 An alternative sunflower transformation protocol allows the recovery of 

. transgenic progeny without the use of chemical selection pressure. This method is 
generally used in cases where the nucleotide sequences of the present invention are 
operably linked to constitutive or inducible promoters. Seeds are dehulled and 
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surface-sterilized for 20 minutes in a 20% Clorox bleach solution with the addition of 
two to three drops of Tween 20 per 100 ml of solution, then rinsed three times with 
distilled water. Sterilized seeds are imbibed in the dark at 26°C for 20 hours on filter 
paper moistened with water. The cotyledons and root radical are removed, and the 
5 meristem explants are cultured on 374E (GBA medium consisting of MS salts, 
Shepard vitamins, 40mg/l adenine sulfate, 3% sucrose, 0.5mg/l 6-BAP, 0.25mg/l 
IAA, 0.1 mg/1 GA, and 0.8% Phytagar at pH 5.6) for 24 hours under the dark. The 
primary leaves are removed to expose the apical meristem, around 40 explants are 
placed with the apical dome facing upward in a 2cm circle in the center of 374M 
1 0 (GBA medium with 1 .2% Phytagar), and then cultured on the medium for 24 hours in 
the dark. 

Approximately 18.8 mg of 1.8 jjm tungsten particles are resuspended in 150 \xl 
absolute ethanol. After sonication, 8 (J.1 of it is dropped on the center of the surface of 
macrocarrier. Each plate is bombarded twice with 650 psi rupture discs in the first 

1 5 shelf at 26mm of Hg helium gun vacuum. 

The plasmid of interest is introduced into Agrobacterium tumefaciens strain 
EHA105 via freeze thawing as described previously. The pellet of overnight-grown 
bacteria at 28°C in a liquid YEP medium (10g/l yeast extract, 10g/l Bactopeptone, 
and 5g/l NaCl, pH 7.0) in the presence of 50fj.g/l kanamycin is resuspended in an 

20 inoculation medium (12.5rnM 2-mM 2-(N-morpholino) ethanesulfonic acid, MES, 
lg/1 NH 4 C1 and 0.3g/l MgS0 4 at pH 5.7) to reach a final concentration of 4.0 at OD 
600. Particle-bombarded explants are transferred to GBA medium (374E), and a 
droplet of bacteria suspension is placed directly onto the top of the meristem. The 
explants are co-cultivated on the medium for 4 days, after which the explants are 

25 transferred to 374C medium (GBA with 1% sucrose and no BAP, IAA, GA3 and 

supplemented with 250 (J-g/ml cefotaxime). The plantlets are cultured on the medium 
for about two weeks under 1 6-hour day and 26°C incubation conditions. 

Explants (around 2 cm long) from two weeks of culture in 374C medium are 
screened for the expression of the nucleotide sequence of the invention or the 

30 presence of the encoded polypeptide of the invention by immunological methods or 

fungicidal activity, or the like. After positive explants are identified, those shoots that 

fail to exhibit fungicidal activity are discarded, and every positive explant is 

subdivided into nodal explants. One nodal explant contains at least one potential 
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node. The nodal segments are cultured on GBA medium for three to four days to 
promote the formation of auxiliary buds from each node. Then they are transferred to 
374C medium and allowed to develop for an additional four weeks. Developing buds 
are separated and cultured for an additional four weeks on 374C medium. Pooled leaf 
5 samples from each newly recovered shoot are screened again by the appropriate 

protein activity assay. At this time, the positive shoots recovered from a single node 
will generally have been enriched in the transgenic sector detected in the initial assay 
prior to nodal culture. 

Recovered shoots positive for a fungicidal polypeptide of the invention are 
1 0 grafted to Pioneer hybrid 6440 in vitro-grovm sunflower seedling rootstock. The 
rootstocks are prepared in the following manner. Seeds are dehulled and surface- 
sterilized for 20 minutes in a 20% Clorox bleach solution with the addition of two to 
three drops of Tween 20 per 100ml of solution, and are rinsed three times with 
distilled water. The sterilized seeds are germinated on the filter moistened with water 
1 5 for three days, then they are transferred into 48 medium (half-strength MS salt, 0.5% 
sucrose, 0.3% gelrite pH 5.0) and grown at 26°C under the dark for three days, then 
incubated at 16-hour-day culture conditions. The upper portion of selected seedling 
is removed, a vertical slice is made in each hypocotyl, and a transformed shoot is 
inserted into a V-cut. The cut area is wrapped with parafilm. After one week of 
20 culture on the medium, grafted plants are transferred to soil. In the first two weeks, 
they are maintained under high humidity conditions to acclimatize to a greenhouse 
environment. 



Example 19. Preparation of Antibodies. 

Standard methods for the production of antibodies were used such as those 
described in Harlow and Lane (1988) Antibodies: A Laboratory Manual (Cold Spring 
Harbor, NY: Cold Spring Harbor Laboratory; incorporated herein in its entirety by 
reference. Specifically, antibodies for polypeptides of the invention were produced 
by injecting female New Zealand white rabbits (Bethyl Laboratory, Montgomery, 
Tex.) six times with 100 micrograms of denatured purified polypeptide. 

Animals were then bled at two week intervals. The antibodies were purified 
by afSnity-chromatography with Affigel 15 (BioRad)-immobilized antigen as 
described by Harlow and Lane (1 988) Antibodies: A Laboratory Manual, Cold Spring 
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Harbor, N. Y. The affinity column was prepared with purified polypeptide essentially 
as recommended by BioRad RTM. Immune detection of antigens on PVDF blots was 
carried out following the protocol of Meyer et al. (1988) J. Cell. Biol. 107:163; 
incorporated herein in its entirety by reference, using the ECL kit from Amersham 
5 (Arlington Heights, III.). 

Example 20. Construction of Fusl Transformation Vector 
A synthetic version of the Fusl gene corresponding to the mature Fusl peptide 
was constructed with a codon-bias representative of Manduca sexta (SEQ ID NO: 120 

1 0 and SEQ ID NO: 122). The codon preference selected for Fusl was derived from the 
Kazusa codon usage database (available from www.Kazusa.or.jp/codon/). The BAA 
signal sequence was added to Fusl to facilitate export of out of the cell and into the 
intercellular space (Rahmatullah RJ et al (1989) Plant MoL Biol 12(1):1 19-121). 
The BAA-Fusl amino acid sequence is set forth in SEQ ID NO: 121 and SEQ ID 

1 5 NO: 123. Strong constitutive promoters were chosen to express Fusl in tissues 
susceptible to F. verticilloides. BAA-Fusl (SEQ ID NO: 120) was subsequently 
subcloned into the corresponding sites of vectors containing either the maize ubiquitin 
promotenubi-intron or the maize h2B promoter: ubi-intron (US Patent Number 
6,177,61 1, herein incorporated by reference). BAA-Fusl was placed behind the 

20 indicated promoter with a 3' sequence corresponding to the pinll terminator. This 
cassette is flanked by non-compatible restriction enzyme sites designed to 
directionally clone the cassette into a binary piasmid containing the selectable marker 
gene cassette 35S-PAT-35S. The restriction enzyme sites were used to subclone the 
promoter/intron:BAA-Fusl :pinll ter cassette into a binary piasmid for corn 

25 transformation. 
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Example 21. Construction of Fus2 Transformation Vectors 
A synthetic version of Fus2 operably linked to a modified barley alpha 
amylase (BAA) signal peptide was constructed with a codon-bias representative of 
Streptomyces coelicolor (SEQ ID NO: 124 and SEQ ID NO: 126). S. coelicolor codon 
5 usage was chosen because of its overall similarity to the codon usage observed in 
plants. The codon preference selected for Fus2 was derived from the Kazusa codon 
usage database (available from www.Kazusa.or.jp/codon/). See also Tables 1 and 2. 
The BAA signal sequence was added to Fus2 to facilitate export of Fus2 out of the 
cell and into the intercellular space. Modifications to the 3' end of the signal peptide 
1 0 were made to achieve correct signal peptide cleavage as predicted by the SIGNALP 
(Version 1.1) program (Center for Biological Sequence Analysis, Technical 
University of Denmark). The BAA-Fus2 amino acid sequence is set forth in SEQ ID 
NO: 125 and SEQ ID NO: 127. The synthetic gene was constructed using a series of 
overlapping complementary oligonucleotides that were annealed together, Klenow 
treated to repair the gaps, and PCR amplified using primers corresponding to 5' and 
3' ends of the synthetic gene. Restriction enzyme sites were incorporated into the 
PCR primers to facilitate gene cloning. The PCR product was TOPO cloned into 
pCR2.1 (Invitrogen) and sequence verified. A restriction enzyme fragment containing 
BAA-Fus2 was subsequently subcloned into the corresponding sites of vectors 
20 containing either the maize ubiquitin promoter: ubi-intron or the maize h2B 

promoter :ubi-intron. The vectors contained a 3' sequence corresponding to the pinll 
terminator. The BAA-Fus2 fragment was cloned between the indicated promoter and 
the pinll terminator. Strong constitutive promoters were chosen to express Fus2 in 
tissues susceptible to F. verticilloides. The promoter/intron:BAA-Fus2:pinn ter 
25 cassette is flanked by non-compatible restriction enzyme sites designed to 

directionally clone the cassette into a binary plasmid containing a selectable marker. 
The restriction enzyme sites were used to subclone the promoter/intron.BAA- 
Fus2:pinII ter cassette into a binary plasmid for corn transformation. 
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Table 1. Streptomyces coelicolor A3(2) [gbbctj: 6257 CDS's (2043281 codons) 
fields: [triplet] [frequency: per thousand] ([number]) ~~— — — 



uuu 


0.4 ( 863) 


ucu 


0.6 ( 1 266) 


UAU 


1.0 ( 1962) 


UCU 


0.7 ( 1448) 


uuc 


26.0( 53065) 


UCC 


20.2 (41262) 


UAC 


19.5 (39789) 


UGC 


7.0( 14341) 


UUA 


0.1( 128) 


UCA 


1.0 ( 2137) 


UAA 


0.1 ( 290) 


UGA 


2.4 ( 4878) 


UUC 


2.4 ( 4935) 


UCG 


13.8 ( 28229) 


UAG 


0.5 ( 1089) 


UGC 


15.1 30770) 


cuu 


1-5 ( 3129) 


ecu 


1.5 ( 2995) 


CAU 


1.6 ( 3366) 


CGU 


5.5 ( 1 1 1 83) 


cue 


36.6 ( 74736) 


ccc 


25.4 ( 51951) 


CAC 


21.5 (44018) 


CGC 


39 1(79956) 


CUA 


0.3 ( 657) 


CCA 


1.3 ( 2633) 


CAA 


1.3 ( 2593) 


CGA 


2.5 ( 5124) 


cue 


61.3 (125241) 


CCG 


33.6 ( 68652) 


CAG 


25.1 (51248) 


CGC 


32.0 65332) 


AUU 


0.6 ( T228) 


ACU 


1.1 ( 2347) 


AAU 


0.7 ( 1436) 


AGU 


1.5 ( 3030) 


AUC 


27.6 ( 56340) 


ACC 


39.6 ( 80826) 


AAC 


16.2 (33191) 


AGC 


12.3 25187) 


AUA 


0.7 ( 1367) 


ACA 


1 .6 ( 3 1 94) 


AAA 


1.0 ( 2041) 


AGA 


0.8 ( 1 574) 


AUG 


15.8 ( 32271) 


ACG 


18.9 ( 38697) 


AAG 


19.7(40293) 


AGC 


3.7 ( 7488) 


CUU 


1.4 ( 2905) 


GCU 


2.9 ( 5908) 


GAU 


2.9 ( 6024) 


CGU 


9.3 ( 1 8920) 


cue 


47.2 ( 96460) 


GCC 


78.6(160548) 


GAC 


58.0(1 18595) 


GGC 


61.4(125467) 


GUA 


2.7 ( 5416) 


GCA 


5.3 ( 10890) 


GAA 


8.5 (17445) 


GGA 


7.1 ( 14608) 


CUG 


35.3 ( 72144) 


GCG 


49.8 (101831) 


GAG 


48.5 (99056) 


GGG 


18.2(37288) 



Coding GC 72.38% 1st letter GC 72.74% 2nd letter GC 51.39% 3rd letter GC 93.00% 
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Fable 2. Streptornyces coelicolor {gbbctj: 2 J 10 CDS's (646333 coclons) 



fields: [triplet] [frequency: per thousand] ([number]) 



UUU 0.5 ( 329) UCU 

UUC 25.7(16596) UCC 

UUA 0.1 ( 49) UCA 

UUC 2.6 ( 1696) UCG 

CUU 1.9 ( 1228) CCU 

CUC 36.2 (23411) CCC 

CUA 0.5 ( 304) CCA 

CUC 59.3 ( 38346) CCC 

AUU 0.8 ( 497) ACU 

AUC 27.8(17997) ACC 

AUA 0.7 ( 444) ACA 

AUG 16.1 ( 10392) ACG 

CUU 1.7 ( 1086) CCU 

GUC 46.3 ( 29904) GCC 

GUA 2.7 ( 1767) GCA. 

GUC 33.9( 21929) CCG 



0.8 ( 496) UAU 

20.1 (12971) UAC 

1.2 ( 797) UAA 

13.5 ( 8729) UAG 

1.8 ( 1178) CAU 

25.4 ( 16419) CAC 
1.6 ( 1018) CAA 

32.7(21145) CAC 

1.4 ( 925) AAU 

39.9 ( 25804) AAC 

1.9 ( 1245) AAA 
19.1 ( 12377) AAC 

3.8 ( 2429) GAU 

77.5 ( 50098) GAC 
6.7(4302) GAA 

48.6 ( 31399) GAG 



1-0 ( 676) UGU 

19.4 (12521) UGC 

0.2 ( 105) UGA 

0.5 ( 355) UGC 

1.9( 1251) CCU 

22.6 (14594) CGC 

1.7 ( 1076) CGA 

25.8 ( 16671) CGC 

0.8 ( 515) AGU 

1 6.2 ( 1 0447) ACC 

1.3 ( 829) AGA 

19.8 ( 12795) AGG 

3.5 ( 2251) CCU 
58.2 ( 37624) GGC 

9.6 (6215) CGA 

47.9 (30970) GGG 



0.8 ( 517) 
7.3 ( 4734) 
2.6 ( 1650) 
15.2(9813) 

5.6 ( 3602) 
39.2 ( 25310) 

2.9 ( 1885) 
31.5 (20333) 

1.6 ( 1023) 
12.7( 8194) 
0.8 ( 537) 
3.8 ( 2441) 

9.1 ( 5867) 
58.8 ( 38034) 
7.3 ( 4689) 
17.8 ( 1 1502) 



Coding GC 71.94% 1st letter GC 72.38% 2nd letter GC 51.28% 3rd letter GC 92.14% 



1 0 AH publications and patent applications mentioned in the specification are 

indicative of the level of those skilled in the art to which this invention pertains. All 
publications and patent applications are herein incorporated by reference to the same 
extent as if each individual publication or patent application was specifically and 
individually indicated to be incorporated by reference. 

1 5 Although the foregoing invention has been described in some detail by way of 

illustration and example for purposes of clarity of understanding, it will be obvious 
that certain changes and modifications may be practiced within the scope of the 
appended claims. 
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THAT WHICH IS CLAIMED: 
1 . An isolated nucleic acid molecule comprising a nucleotide sequence 
selected from the group consisting of: 
5 (a) a nucleotide sequence set forth in SEQ ID NO: 1 , 3, 5, 9, 1 1 , 1 3, 

15, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, 75, 78, 81, 84, 
87, 90,93, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, or 126; 

(b) a nucleotide sequence that encodes a polypeptide set forth in 
SEQ ID NO:2, 4, 6, 10, 12, 14, 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 

10 41, 43, 44, 46, 47, 49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 
76, 77, 79, 80, 82, 83, 85, 86, 88, 89, 91, 92, 94, 95, 101, 103, 105, 107, 109, 111, 
113, 115,117,119,121, 123, 125, or 127; 

(c) a nucleotide sequence encoding a polypeptide having at least 
about 90% sequence identity to the amino acid sequence shown in SEQ ID NO:2, 4, 

15 6, 10, 12, 14, 16,22,23,25,26,28,29,31,32,34,35,37,38,40,41,43,44,46,47, 
49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 80, 82, 
83, 85, 86, 88, 89,91,92, 94, 95, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 
121, 123, 125, or 127, wherein said polypeptide possesses defensive activity; 

(d) a nucleotide sequence that hybridizes under stringent 

20 conditions to a nucleotide sequence having a sequence set forth in SEQ ID NO:l, 3, 5, 
9, 11, 13, 15, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, 75, 
78, 81, 84, 87, 90, 93, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 
124, or 126, wherein said nucleotide sequence encodes a polypeptide possessing 
defensive activity; 

25 (e) a nucleotide sequence having 99% identity to a nucleotide 

sequence that encodes an amino acid sequence set forth in SEQ ID NO:2, 4, 6, 10, 12, 
14, 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 
53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 80, 82, 83, 85, 86, 
88,89,91,92,94,95, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121,123, 

30 125, or 127, wherein said nucleotide sequence encodes a polypeptide possessing 
defensive activity; and 

(f) a nucleotide sequence consisting of a complement of any one of 
the nucleotide sequences in (a), (b), (c), (d), or (e). 
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2. The nucleic acid molecule of claim 1 further comprising vector nucleic 
acid sequences. 

5 3. A host cell engineered to express the polypeptide encoded by any one 

of the nucleic acid molecules of claim 1. 

4. The host cell of claim 3 wherein the host cell is selected from the 
group consisting of fungi, yeast, and plants. 

10 

5. A virus comprising an isolated nucleic acid molecule of claim 1. 

6. An expression cassette comprising a nucleic acid molecule of claim 1 , 
wherein said nucleic acid is operably linked to a promoter that drives expression in a 

1 5 plant cell. 

7. The expression cassette of claim 6, wherein said promoter is selected 
from the group consisting of constitutive, inducible, and tissue-preferred promoters. 

20 8 - Th e expression cassette of claim 7, wherein said promoter is a 

pathogen-inducible promoter. 

9. An isolated polypeptide comprising an amino acid sequence selected 
from the group consisting of: 

15 00 an amino acid sequence set forth in SEQ ID NO:2, 4, 6, 10, 12, 

14, 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 
53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 80, 82, 83, 85, 86, 
88,89,91,92,94,95, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 
125, or 127; 

50 (b) an amino acid sequence having at least about 90% sequence 

identity to the amino acid sequence set forth in SEQ ID NO:2, 4, 6, 10, 12, 14, 16, 22, 
23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 53, 55, 56, 
58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 80, 82, 83, 85, 86, 88, 89, 91, 
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92,94, 95, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, or 127, 
wherein said polypeptide possesses defensive activity; and 

(c) an amino acid sequence comprising at least 35 contiguous 
amino acids of the amino acid sequence set forth in SEQ ID NO:2, 4, 6, 10, 12, 14, 
5 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 53, 
55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 80, 82, 83, 85, 86, 88, 
89,91,92, 94, 95, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, or 
1 27, wherein said polypeptide possesses defensive activity. 



10 1 0. A composition comprising the isolated polypeptide of claim 9. 

11. A transformed plant comprising in its genome at least one stably 
incorporated expression cassette comprising a nucleotide sequence operably linked to 
a promoter that drives expression in said plant cell, wherein said nucleotide sequence 
15 comprises a nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence set forth in SEQ ID NO:l, 3, 5, 9, 1 1, 13, 
15, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, 75, 78, 81, 84, 
87, 90,93, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, or 126; 

(b) a nucleotide sequence that encodes a polypeptide set forth in 
20 SEQ ID NO:2, 4, 6, 10, 12, 14, 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 

41, 43, 44, 46, 47, 49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 
76, 77, 79, 80, 82, 83, 85, 86, 88, 89, 91, 92, 94, 95, 101, 103, 105, 107, 109, 111, 
113, 115, 117, 119, 121, 123, 125, or 127; 

(c) a nucleotide sequence encoding a polypeptide having at least 
25 about 90% sequence identity to the amino acid sequence shown in SEQ ID NO:2, 4, 

6, 10, 12, 14, 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 
49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 80, 82, 
83, 85, 86, 88, 89, 91, 92, 94, 95, 101, 103, 105, 107, 109, 1 1 1, 1 13, 115, 1 17, 1 19, 
121, 123, 125, or 127, wherein said polypeptide possesses defensive activity; and 
30 (d) a nucleotide sequence encoding a polypeptide comprising at 

least 35 contiguous amino acids of SEQ ID NO:2, 4, 6, 10, 12, 14, 16, 22, 23, 25, 26, 
28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 53, 55, 56, 58, 59, 61, 
62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 80, 82, 83, 85, 86, 88, 89, 91, 92, 94, 95, 
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101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, or 127, wherein said 
polypeptide possesses defensive activity. 

1 2. The transformed plant of claim 1 1 , wherein said promoter is selected 
5 from the group consisting of constitutive, inducible, and tissue-preferred promoters. 

1 3 . The transformed plant of claim 1 2, wherein said promoter is a 
pathogen-inducible promoter. 

10 1 4 - The transformed plant of claim 1 1 , wherein said plant is selected from 

the group consisting of rice, com, alfalfa, sunflower, Brassica, soybean, cotton, 
safflower, peanut, sorghum, wheat, millet, and tobacco. 



15 



1 5 . Transformed seed of the plant of claim 1 1 . 



1 6. A method for enhancing plant disease resistance to fungal pathogens, 
said method comprising: 

(a) transforming a plant with at least one stably incorporated 
expression cassette comprising a nucleotide sequence operably linked to a promoter 
20 that drives expression in a cell of said plant, wherein said nucleotide sequence 
comprises a nucleotide sequence selected from the group consisting of: 

(i) a nucleotide sequence set forth in SEQ ID NO: 1 , 3, 5, 
9, 1 1, 13, 15, 21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, 75, 
78, 81, 84, 87, 90, 93, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 

25 124, or 126; 

(ii) a nucleotide sequence that encodes a polypeptide set 
forth in SEQ ID NO:2, 4, 6, 10, 12, 14, 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 
38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 
73, 74, 76, 77, 79, 80, 82, 83, 85, 86, 88, 89, 91, 92, 94, 95, 101, 103, 105, 107, 109, 

30 111, 113, 115, 117, 119, 121, 123, 125, or 127; 

(iii) a nucleotide sequence encoding a polypeptide having at 
least about 90% sequence identity to the amino acid sequence shown in SEQ ID 
NO:2, 4, 6, 10, 12, 14, 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 

-80- 



_Q2O66072A2J„> 



WO 02/086072 PCT/US02/1251 1 

46, 47,49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 
80, 82, 83, 85, 86, 88, 89, 91, 92, 94, 95, 101, 103, 105, 107, 109, 1 1 1, 113, 115, 117, 
119, 121 , 123, 125, or 127, wherein said polypeptide possesses defensive activity; 

(iv) a nucleotide sequence encoding a polypeptide 
5 comprising at least 35 contiguous amino acids of SEQ ID NO:2, 4, 6, 10, 12, 14, 16, 
22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 46, 47, 49, 50, 52, 53, 55, 
56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 80, 82, 83, 85, 86, 88, 89, 
91, 92,94, 95, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, or 
127, wherein said polypeptide possesses defensive activity; and 
1 0 (b) determining the level of increased resistance to said fungal 

pathogen in said plant. 

17. The method of claim 16, wherein said plant is selected from the group 
consisting of rice, com, alfalfa, sunflower, Brassica, soybean, cotton, safflower, peanut, 

1 5 sorghum, wheat, millet, and tobacco. 

1 8. The method of claim 1 6, wherein said plant possesses enhanced 
resistance to Magnaportha grisea, Rhizoctonia solani, or Fusarium verticilloides, 

20 1 9. An antibody that selectively binds to an isolated polypeptide 

comprising an amino acid sequence comprising at least 35 contiguous amino acid 
residues of an amino acid sequence selected from the group consisting of SEQ ID 
NO:2, 4, 6, 10, 12, 14, 16, 22, 23, 25, 26, 28, 29, 31, 32, 34, 35, 37, 38, 40, 41, 43, 44, 
46, 47, 49, 50, 52, 53, 55, 56, 58, 59, 61, 62, 64, 65, 67, 68, 70, 71, 73, 74, 76, 77, 79, 

25 80,82,83,85,86,88, 89,91,92,94,95, 101, 103, 105, 107, 109, 111, 113, 115, 117, 
119, 121, 123, 125, or 127. 

20. A method for isolating plant disease resistance-conferring polypeptides 
in an insect, said method comprising: 
30 (a) injecting said insect with a suspension of a plant fungal 

pathogen; 

(b) collecting said insect hemolymph; and 

-81- 



. 020860 72A2J_> 



WO 02/086072 PCT/US02/I25U 
(c) isolating said plant disease resistance-conferring polypeptides 
contained in said insect hemolymph using liquid chromatography and mass 
spectrophotometry. 

5 2 1 . An isolated nucleic acid molecule comprising a nucleotide sequence 

selected from the group consisting of: 

(a) a nucleotide sequence comprising at least 25 contiguous 
nucleotides of the nucleotide sequence set forth in SEQ ID NO: 1, 3, 5, 9, 1 1, 13, 15, 
21, 24, 27, 30, 33, 36, 39, 42, 45, 48, 51, 54, 57, 60, 63, 66, 69, 72, 75, 78, 81, 84, 87, 

10 90, 93, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, or 126; and 

(b) a nucleotide sequence consisting of a complement of any one 
of the nucleotide sequences in (a). 

22. An isolated nucleic acid molecule comprising a nucleotide sequence 
1 5 selected from the group consisting of: 

(a) a nucleotide sequence set forth in nucleotides from 169 to 298 of 
SEQ ID NO:ll; 

(b) a nucleotide sequence set forth in nucleotides from 58 to 624 of 

SEQ ID NO:3; 

20 ( c ) a nucleotide sequence set forth in nucleotides from 86 to 208 of 

SEQ ID NO: 15; 

(d) a nucleotide sequence set forth in nucleotides from 46 to 216 of 
SEQ ID NO: 13; and 

(e) a nucleotide sequence having 90% identity to a nucleotide 
25 sequence in (a), (b), (c), or (d), wherein said nucleotide sequence encodes a 

polypeptide having defensive activity. 

23. An isolated polypeptide comprising an amino acid sequence selected 
from the group consisting of: 

30 (a) an amino acid sequence encoded by the nucleotide sequence set 

forth in nucleotides from 1 69 to 298 of SEQ ID NO: 1 1 ; 

(b) an amino acid sequence encoded by the nucleotide sequence set 
forth in nucleotides from 58 to 624 of SEQ ID NO:3; 
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(c) an amino acid sequence encoded by the nucleotide sequence set 
forth in nucleotides from 86 to 208 of SEQ ID NO: 15; 

(d) an amino acid sequence encoded by the nucleotide sequence set 
forth in nucleotides from 46 to 216 of SEQ ID NO:13; and 

5 (e) an amino acid sequence having at least about 90% sequence 

identity to the amino acid sequence set forth in (a), (b), (c), or (d), wherein said 
polypeptide possesses defensive activity. 
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Peptide sequences from Lys-C digested Maq1 

maglysc18: 

1 5 10 
VGASLGAAHTDF 

maglysc24: 

15 10 15 
NNIFSAIGGADFNANHK 



maglysc29: 

1 5 10 
KFDTPFMRSGWE 



maglysc36: 



1 5 10 
LNLFHNNNHDLT 



FIG. 3 
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SEQUENCE LISTING 

<110> Altier, Daniel J. 
Herrmann, Rafael 
Lu, Albert L. 
McCutchen, Billy F. 
Presnail, James K. 
Weaver, Janine L. 
Wong, James F. Hi 

<120> Antimicrobial Polypeptides and Their 
Uses 

<130> 35718/244486 

<150> 60/285,355 
<151> 2001-04-20 

<160> 127 

<170> FastSEQ for Windows Version 4.6 

<210> 1 
<211> 766 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) ... (621) 

<400> 1 

atg ttc acc aaa ttc gtc gtc ctg gtc tgt ctt etc gtt ggt get aag 4 8 

Met Phe Thr Lys Phe Val Val Leu Val Cys Leu Leu Val Gly Ala Lys 
15 10 15 

get egg cct cag etc ggc get etc act ttc aat tct gat ggc act tec 96 
Ala Arg Pro Gin Leu Gly Ala Leu Thr Phe Asn Ser Asp Gly Thr Ser 
20 25 30 



ggg gcg gec gtc aaa gtt cca ttt ggt ggc aac aag aat aat ata ttt 144 

Gly Ala Ala Val Lys Val Pro Phe Gly Gly Asn Lys Asn Asn lie Phe 

35 40 45 

agt get ate ggt ggg get gat ttt aac get aat cac aaa ctg agt tct 192 

Ser Ala lie Gly Gly Ala Asp Phe Asn Ala Asn His Lys Leu Ser Ser 
50 55 60 

gcg act get gga gta gcg ctt gat aat ate cga ggt cac gga etc agt 240 

Ala Thr Ala Gly Val Ala Leu Asp Asn lie Arg Gly His Gly Leu Ser 
65 70 75 80 

ttg acg gat acc cac ate ccc ggc ttt gga gac aag ttg acg gcg gec 288 

Leu Thr Asp Thr His lie Pro Gly Phe Gly Asp Lys Leu Thr Ala Ala 

85 90 95. 

ggc aag ttg aac etc ttc cac aac aac aac cac gat ctg acc gec aac 336 

Gly Lys Leu Asn Leu Phe His Asn Asn Asn His Asp Leu Thr Ala Asn 

100 105 110 

get ttc gec acc agg aac atg ccg aac att cct cag gtt cca aac ttc 384. 

Ala Phe Ala Thr Arg Asn Met Pro Asn lie Pro Gin Val Pro Asn Phe 

115 120 125 

aac acc gtt ggt ggc gga ctg gac tac atg ttc aag aac aag gtg ggc 432 

Asn Thr Val Glv Gly Gly Leu Asp Tyr Met Phe Lys Asn Lys Val Gly 
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130 135 1<J0 

get tea tta ggc gec gcg cac act gac ttt ate aac cgc aac gac tac 
Ala Ser Leu Gly Ala Ala His Thr Asp Phe He Asn Arg Asn Asp Tyr 
145 WO 155 !60 

tct gtg ggc ggc aag ttg aac ctg tte egg aac ccg age acc teg etc 
ier Val G1 y G1 y L V S Leu Asn Leu Phe Arg Asn Pro Ser Thr Ser Leu 
165 1.70 



480 



528 



175 



I C » a ° 9 ?° "° fctt aa9 aag ttc gac ac 9 ccc tte atg aga tec 576 
Asp Phe Asn Ala Gly Phe Lys Lys Phe Asp Thr Pro Phe Met Arg Ser 
180 185 i 90 

~" 2f a CCC aaC atg ggc Ctc tcc cfc c tec aag ttc ttc caa 621 
Gly Trp Glu Pro Asn Met Gly Phe Ser Leu Ser Lys Phe Phe * 
195 200 205 

ata «^tctca 9tattatgaa ttgtcttttt ttattaatgt aatccgcctt 681 
taaatatttt tatataaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 741 
aaaaaaaaaa aaaaaaaaaa aaaaa Jlz, 

<210> 2 
<211> 206 
<212> PRT 

<213> Manduca sexta 



<400> 2 



Met Phe Thr Lys Phe Val Val Leu Val Cys Leu Leu Val Gly Ala Lys 

Ala Arg Pro Gin Leu Gly Ala Leu Thr Phe Asn Ser Asp Gly Thr Ser 
20 25 30 

Gly Ala Ala Val Lys Val Pro Phe Gly Gly Asn Lys Asn Asn He Phe 
Ser Ala He Gly Gly Ala Asp Phe Asn Ala Asn His Lys Leu Ser Ser 
Ala Thr Ala Gly Val Ala Leu Asp Asn He Arg lly His Gly Leu Ser 
Leu Thr Asp Thr His He Pro Gly Phe Gly Z v Lys Leu Thr Ala IL 



Gly Lys Leu Asn Leu Phe His Asn Asn Asn His Asp Leu Thr Ala Asn 

105 

Ala Phe Ala Thr Arg Asn Met Pro Asn He Pro Gin Val Pro Asn Phe 

120 12 5 
Asn Thr val Gly Gly Gly Leu Asp Tyr Met Phe Lys Asn Lys Val Gly 

Ala ser Leu Gly Ala Ala nil Thr Asp Phe lie Asn Arg Asn Asp Tyr 

Ser val Gly Gly Lys Leu Asn Leu Phe Arg Asn Pro Ser Thr Ser Leu 
165 - ~ - 



170 



Asp Phe Asn Ala Gly Phe Lys Lys Phe Asp Thr Pro Phe Met Arg Ser 

185 190 
Gly Trp Glu Pro Asn Met Gly Phe Ser Leu Ser Lys Phe Phe 
195 200 205 



<210> 3 
<211> 760 
<212> DNA 

<213> Manduca sexta 
<220> 

<221> CDS 

<222> (34) . . . (624) 

-2- 
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<221> misc_feature 
<222> (0) . . . (0) 
<223> n = A, T , C or G 

<400> 3 

attcggcacg aggacgatgt ggtgttcgta cca atg gtg gta tea agg gta egg 54 

Met Val Val Ser Arg Val Arg 
1 5 

cgc gac aca cac ggc teg gtc acc gtc aac teg gac ggc ace tec gga 102 
Arg Asp Thr His Gly Ser Val Thr Val Asn Ser Asp Gly Thr Ser Gly 
10 15 20 

gcg ate gtc aag gtg ccg ttc gca ggc gac gac aag aac ate gtc age 150 
Ala lie Val Lys Val Pro Phe Ala Gly Asp Asp Lys Asn lie Val Ser 
25 30 35 

gec ate ggt ggc etc gac etc gac aag aac etc aag atg age ggc gec 198 
Ala lie Gly Gly Leu Asp Leu Asp Lys Asn Leu Lys Met Ser Gly Ala 
40 45 50 55 

aca gcg ggc ttg get tac gac aac gtc aat gga cac ggc get act ctt 246 
Thr Ala Gly Leu Ala Tyr Asp Asn Val Asn Gly His Gly Ala Thr Leu 
60 65 70 

aca aac aca cat ata ccc age ttc ggt gac aag ctg acg gca gec ggc 2 94 
Thr Asn Thr His lie Pro Ser Phe Gly Asp Lys Leu Thr Ala Ala Gly 
75 80 85 

aag. ttg aac gtg ttc cat aac gac aac cac aac ctg gac gtg aag gcg 342 
Lys Leu Asn Val Phe His Asn Asp Asn His Asn Leu Asp Val Lys Ala 
90 95 100 

ttg gec acc agg acc atg ccg gat att ccg cgc gtg ccc gac ttc aac 3 90 
Leu Ala Thr Arg Thr Met Pro Asp lie Pro Arg Val Pro Asp Phe Asn 
105 110 115 

acc tac ggc ggc ggc gtc gac tac atg ttc aag gac aag gtg ggc gcg 43 8 
Thr Tyr Gly Gly Gly Val Asp Tyr Met Phe Lys Asp Lys Val Gly Ala 
120 125 130 135 

teg gcg age. get gcg cac acg cct etc ttc gat cgc aac gac tac tec 486 
Ser Ala Ser Ala Ala His Thr Pro Leu Phe Asp Arg Asn Asp Tyr Ser 
140 145 150 

9tg ggc ggc aag ctg aac ctg ttc cgt gac aag acc acc teg etc gac 534 
Val Gly Gly Lys Leu Asn Leu Phe Arg Asp Lys Thr Thr Ser Leu Asp 
155 160 165 

ttc aac gec gac tac aag aag ttc gag atg ccc aac ttc aag tec gac 582 
Phe Asn Ala Asp Tyr Lys" Lys Phe Glu Met Pro Asn Phe Lys Ser Asp 
170 175 180 

tgg aca ccc aac ate ggc ttc tea ttc age aag ttt tgg tag 624 
Trp Thr Pro Asn He Gly Phe Ser Phe Ser Lys Phe Trp * 
185 190 195 

tttattatta tgattcaagt catccacgtt ttgtacgggt gtaattaatt acgattttaa 6 84 
agtttaagta tttatattta aataaatatt ttggaaatna aaaaaaaaaa aaaaaaaaaa 744 
aaaaaaaaaa ctcgag 760 

<210> 4 
<211> 196 
<212> PRT 

<213> Manduca sexta 

-3- 
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<400> 4 

Val Ser Arg Val Arg Arg Asp Thr His Gly Ser Val Th; 

15 



Met 


Val Val 


Ser 


Arg 


Val 


Arg 


Arg Asp 


Thr His Gly 


Ser 


Val 


1 






5 










10 






Asn 


Ser Asp 


Gly 


Thr 


Ser Gly 


Ala 


He 


Val Lys Val 


Pro 


Phe 






20 










25 






30 


Asp Asp Lys 


Asn 


lie 


Val 


Ser 


Ala 


He 


Gly Gly Leu 


Asp 


Leu 




35 










40 






45 




Asn 


Leu Lys 


Met 


Ser 


Gly 


Ala 


Thr 


Ala 


Gly Leu Ala 


Tyr Asp 




50 








55 






60 






Asn Gly His 


Gly 


Ala 


Thr 


Leu 


Thr 


Asn 


Thr His He 


Pro 


Ser 


£ tr 
D O 








70 








75 






Asp 


Lys Leu 


Thr 


Ala 
85 


Ala 


Gly 


Lvs 


Leu 


Asn Val Phe 
90 


His 


Asn 


His 


Asn Leu 


Asp 
100 


Val 


Lys 


Ala 


Leu 


Ala 
105 


Thr Arg Thr 


Met 


Pro 
110 


Pro 


Arg Val 


Pro 


Asp 


Phe 


Asn 


Thr 


Tyr Gly Gly Gly Val Asp 


Phe 


115 










120 






125 




Lys Asp 


Lys 


Val 


Gly Ala 


Ser 


Ala 


Ser Ala Ala 


His 


Thr 




130 








135 






140 






Phe 


Asp Arg 


Asn 


Asp 


Tyr 


Ser 


Val 


Gly Gly Lys Leu 


Asn 


Leu 


145 








150 








155 






Asp 


Lys Thr 


Thr 


Ser 


Leu 


Asp 


Phe 


Asn Ala Asp Tyr 


Lys 


Lys 








165 










170 


Met 


Pro Asn 


Phe 


Lys 


Ser 


Asp 


Trp Thr 


Pro Asn He 


Gly 


Phe 






180 










185 




190 


Ser 


Lys Phe 
195 


Trp 



















80 

Asp Asi 
95 



160 
Phe Glu 
175 



<210> 5 
<211> 246 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) . . . (240) 

<221> misc_feature 

<222> 234, 242, 243, 244, 246 

<223> n = A,T,C or G 

<400> 5 

atg tec ctg teg tgt etc ttc etc gtt gcg ctg gcg ctg gtg ggc gca 

Met Ser Leu Ser Cys Leu Phe Leu Val Ala Leu Ala Leu Val Gly Ala 

1 1 5 10 15 

gag age aga tac ate gec gac gat gtg gtg ttg gta ccg atg atg gta 
Glu Ser Arg Tyr He Ala Asp Asp Val Val Leu Val Pro Met Met Val 
20 25 30 



aac gtc ttt age gec ate ggt ggt etc gac etc gat aag aan etc aag 
Asn Val Phe Ser Ala He Gly Gly Leu Asp Leu Asp Lys Xaa Leu Lys 
65 ™ 75 80 

annngn 
<210> 6 

-4- 



48 



96 



tea egg gta agg cgc gac aca cac ggc teg gtc acc gtc aac teg gac 144 
Ser Arg Val Arg Arg Asp Thr His Gly Ser Val Thr Val Asn Ser Asp 
35 40 45 

ggc acc tec ggg age gtc gtc aag gtg ccg ttc gca ggc gac gac aag 192 
Gly Thr Ser Gly Ser Val Val Lys Val Pro Phe Ala Gly Asp Asp Lys 
50 55 60 



240 



246 



30CC10: <WO 02086072A2_I_> 
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<211> 80 
<212> PRT 

<213> Manduca sexta 
<220> 

<221> VARIANT 
<222> 78 

<223> Xaa » Any Amino Acid 
<400> 6 

Met Ser Leu Ser Cys Leu Phe Leu Val Ala Leu Ala Leu Val Gly Ala 

1 5 io 15 

Glu Ser Arg Tyr He Ala Asp Aep Val Val Leu Val Pro Met Met Val 

20 25 30 

Ser Arg Val Arg Arg Asp Thr His Gly Ser Val Thr Val Asn Ser Asp 

35 40 45 

Gly Thr Ser Gly Ser Val Val Lys Val Pro Phe Ala Gly Asp Asp Lys 

50 55 60 

Asn Val Phe Ser Ala He Gly Gly Leu Asp Leu Asp Lys Xaa Leu Lys 
65 70 75 80 

<210> 7 
<211> 336 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) . . . (336) 

<400> 7 

ggc acg agg tec ctg teg tgc etc ttg tta ttt gcg ctg gcg ctg atg 4 8 

Gly Thr Arg Ser Leu Ser Cys Leu Leu Leu Phe Ala Leu Ala Leu Met 

1 5 io 



15 



ggc gcg gag age aga ttc ate gec gac gat gtg gtg ttc gta cca atg 96 
Gly Ala Glu Ser Arg Phe He Ala Asp Asp Val Val Phe Val Pro Met 
20 25 30 

gtg gta tea agg gta egg cgc gac aca cac ggc teg gtc ace gtc aac 144 
Val Val Ser Arg Val Arg Arg Asp Thr His Gly Ser Val Thr Val Asn 
35 40 45 

teg gac ggc ace tec gga gcg ate gtc aag gtg ccg ttc gca ggc gac 192 
Ser Asp Gly Thr Ser Gly Ala He Val Lys Val Pro Phe Ala Gly Asp 
50 55 60 

gac aag aac ate gtc age gee ate ggt ggc etc gac etc gac aag aac 
Asp Lys Asn He Val Ser Ala He Gly Gly Leu Asp Leu Asp Lys Asn 
65 70 75 so 

etc aag atg age ggc gec aca gcg ggc ttg get tac gac aac gtc aat 
Leu Lys Met Ser Gly Ala Thr Ala Gly Leu Ala Tyr Asp Asn Val Asn 
85 90 95 

gga cac ggc get act ctt aca aac aca cat ata ccc aag ctt egg tga 
Gly His Gly Ala Thr Leu Thr Asn Thr His He Pro Lys Leu Arg * 
100 105 no 



240 



288 



336 



<210> 8 
<211> 111 
<212> PRT 

<213> Manduca sexta 

-5- 
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<400> 8 



Gly Thr 


Arg 


Ser 


Leu 


Ser 


Cys 


Leu 


Leu Leu Phe 


Ala Leu Ala 


Leu Met 


1 






5 








10 




15 


Gly Ala 


Glu 


Ser 


Arg 


Phe 


lie 


Ala 


Asp Asp Val 


Val Phe Val 


Pro Met 






20 










25 


30 




Val Val 


Ser 


Arg 


Val 


Arg 


Arg 


Asp 


Thr His Gly 


Ser Val Thr 


Val Asn 




35 










40 




45 




Ser Asp 


Gly 


Thr 


Ser 


Gly 


Ala 


He 


Val Lys Val 


Pro Phe Ala 


Gly Asp 


50 










55 






60 


Asp Lys 


Asn 


lie 


Val 


Ser 


Ala 


He 


Gly Gly Leu Asp Leu Asp Lys Asn 


65 








70 






75 




80 


Leu Lys 


Met 


Ser 


Gly 


Ala 


Thr 


Ala 


Gly Leu Ala 


Tyr Asp Asn 


Val Asn 








85 








90 


95 . 


Gly His 


Gly 


Ala 


Thr 


Leu 


Thr 


Asn 


Thr His lie 


Pro Lys Leu 


Arg 






100 










105 


110 



t210> 9 

<211> 444 

<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) . . . (444) 

<221> misc_f eature 
<22Z> 123, 339, 421 
<223> n = A,T,C or G 

<400> 9 

atg tec ctg teg tgc etc ttg tta ttt gcg ctg gcg ctg atg ggc gec 48 

Met Ser Leu Ser Cys Leu Leu Leu Phe Ala Leu Ala Leu Met Gly Ala 

1 5 io 15 

gag age aga tac ate get gac gat gtg gtg ttc gta ccg ata gtg gta 96 
Glu Ser Arg Tyr He Ala Asp Asp Val Val Phe Val Pro He Val Val 
20 25 30 

tea agg gta egg cgt gac aca cac ggn teg gtc acc gtc aac teg gac 144 
Ser Arg Val Arg Arg Asp Thr His Gly Ser Val Thr Val Asn Ser Asp 



35 40 



45 



ggc acc tec gga gcg ate gtc aag gtg ccg ttc gca ggc aac gac aag 192 
Gly Thr Ser Gly Ala He Val Lys Val Pro Phe Ala Gly Asn Asp Lys 
50 55 60 

aac ate gtc age gec ate ggc ggc etc gac etc gac aag aac ttc aag 
Asn He Val Ser Ala He Gly Gly Leu Asp Leu Asp Lys Asn Phe Lys 
65 70 75 ' 80 

atg age ggc gec aca gcg ggc ttg gca tac gac aac gtc aat aga cac 
Met Ser Gly Ala Thr Ala Gly Leu Ala Tyr Asp Asn Val Asn Arg His 
85 90 95 

ggg get act ctt aca aac aca cat ata ccc age ttc ggt gac aag ctg 
Gly Ala Thr Leu Thr Asn Thr His lie Pro Ser Phe Gly Asp Lys Leu 
1( >0 105 HO 



240 



288 



336 



acn gca acc ggc aag ttg aac gtg ttc caa aac gac aaa cac aac cct 384 
Thr Ala Thr Gly Lys Leu Asn Val Phe Gin Asn Asp Lys His Asn Pro 
115 120 125 

gga cgt gaa ggg gtt ggg cac caa gga cca tgc caa nta ttc cac gcg 432 

-6- 
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Gly Arg Glu Gly Val Gly His Gin Gly Pro Cys Gin Xaa Phe His Ala 
130 135 140 

tgg ccg act tea 444 

Trp Pro Thr Ser 

145 



<210> 10 
<211> 148 
<212> PRT 

<213> Manduca sexta 
<220> 

<221> VARIANT 
<222> 141 

<223> Xaa = Any Amino Acid 
<400> 10 



Met 


Ser 


Leu 


Ser 


Cys 


Leu 


Leu 


Leu Phe Ala Leu 


Ala 


Leu Met 


Gly Ala 


1 








5 






10 






15 


Glu 


Ser 


Arg 


Tyr 


He 


Ala 


Asp 


Asp Val Val Phe 


Val 


Pro He 


Val Val 








20 








25 




30 




Ser 


Arg 


Val 


Arg 


Arg 


Asp 


Thr 


His Gly Ser Val 


Thr 


Val Asn 


Ser Asp 






35 










40 




45 




Gly 


Thr 


Ser 


Gly 


Ala 


He 


Val 


Lys Val Pro Phe 


Ala 


Gly Asn Asp Lys 




50 










55 




60 






Asn 


He 


Val 


Ser 


Ala 


He 


Gly Gly Leu Asp Leu Asp Lys Asn 


Phe Lys 


65 










70 




75 






80 


Met 


Ser 


Gly 


Ala 


Thr 


Ala 


Gly 


Leu Ala Tyr Asp 


Asn 


Val Asn 


Arg His 










85 






90 






95 


Gly 


Ala 


Thr 


Leu 


Thr 


Asn 


Thr 


His He Pro Ser 


Phe 


Gly Asp 


Lys Leu 








100 








105 




110 




Thr 


Ala 


Thr 


Gly 


Lys 


Leu 


Asn 


Val Phe Gin Asn 


Asp 


Lys His 


Asn Pro 






115 










120 




125 




Gly 


Arg 


Glu 


Gly 


Val 


Gly 


His 


Gin Gly Pro Cys 


Gin 


Xaa Phe 


His Ala 




130 










135 




140 






Trp 


Pro 


Thr 


Ser 

















145 



<210> 11 
<211> 617 
<212> DNA 

<213> Manduca sexta 

<220> 
<221> CDS 

<222> (28) . . . (456) 
<400> 11 

gaatteggea cgaggctacg ggctaca atg tct aag ttt ata tec ata ctt tgt 54 

Met Ser Lys Phe He Ser lie Leu Cys 
1 5 

gtt gtc gec tta ctg eta ata gca gaa act tat tgt tta aca agt ggt 102 
Val Val Ala Leu Leu Leu He Ala Glu Thr Tyr Cys Leu Thr Ser Gly 
10 15 20 25 

gtt cgc ate ata caa ccc act tat agg cct cca ccc agg aga cct gtt 150 
Val Arg He He Gin Pro Thr Tyr Arg Pro Pro Pro Arg Arg Pro Val 
30 35 40 

att tac aga get gca cgc gac get gga gat gaa ccc ttg tgg ctg tac 198 
He Tyr Arg Ala Ala Arg Asp Ala Gly Asp Glu Pro Leu Trp Leu Tyr 
45 50 55 

-7- 
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246 



caa gga gac gac cac cct cga gcc cct tea age ggc gac cat cct gta 

Gin Gly Asp Asp His Pro Arg Ala Pro Ser Ser Gly Asp His Pro Val 
60 65 " 70 

ctg ccc teg ate ata gac gat gtg aag ctg gac ccc aac agg egg tat 294 

Leu Pro Ser He He Asp Asp Val Lys Leu Asp Pro Asn Arg Arg Tyr 
75 80 85 

gcg cgt agt gta age gag cct teg tea cag gag cat cat gac cgc ttt 342 

Ala Arg Ser Val Ser Glu Pro Ser Ser Gin Glu His His Asp Arg Phe 
90 95 100 105 



390 



gcg agg age ttc gac tec cgc age age aag cat cac ggc ggc agt cac 
Ala Arg Ser Phe Asp Ser Arg Ser Ser Lys His His Gly Gly Ser His 
110 H5 120 

tec acg tec ggc ggc age cgc gac act gga get act cac ccg gga tac 438 
Ser Thr Ser Gly Gly Ser Arg Asp Thr Gly Ala Thr His Pro Gly Tyr 
125 130 135 

aat cgt cgt aac tea taa tttctcttca gtttctaaat atttttgttt 486 
Asn Arg Arg Asn Ser * 
140 

ctgctactaa ttttttctca tcaatattct tgtttgcttt caaatctttc attttatgat 546 
aataatatgt afcactgatca ttatattgaa ataaatgatt aaattgaaaa aaaaaaaaaa 606 
aaaaactcga g 



617 



<210> 12 
<211> 142 
<212> PRT 

<213> Manduca sexta 
<400> 12 

Met Ser Lys Phe He Ser He Leu Cys Val Val Ala Leu Leu Leu He 

5 3-0 15 

Ala Glu Thr Tyr Cys Leu Thr Ser Gly Val Arg He He Gin Pro Thr 

20 25 30 

Tyr Arg Pro Pro Pro Arg Arg Pro Val He Tyr Arg Ala Ala Arg Asp 

35 40 45 

Ala Gly Asp Glu Pro Leu Trp Leu Tyr Gin Gly Asp Asp His Pro Arq 

Ala Pro Ser Ser Gly Asp His Pro Val Leu Pro Ser He He Asp Asp 

5 70 75 80 

Val Lys Leu Asp Pro Asn Arg Arg Tyr Ala Arg Ser Val Ser Glu Pro 

85 90 95 

Ser Ser Gin Glu His His Asp Arg Phe Ala Arg Ser Phe Asp Ser Arg 

100 105 HO 

Ser Ser Lys His His Gly Gly Ser His Ser Thr Ser Gly Gly Ser Arg 

115 120 125 

Asp Thr Gly Ala Thr His Pro Gly Tyr Asn Arg Arg Asn Ser 
130 135 i 4 o 

<210> 13 
<211> 370 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) . . . (216) 

<400> 13 

eta tac tgt ctt ttg ttt ttg tgt ttc att act ttc tec atg agt gaa 

-8- 
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Leu Tyr Cys lieu Leu Phe Leu Cys Phe lie Thr Phe Ser Met Ser Glu 
1 5 10 15 

gat ccg aga tgt tct cag ccg att gca tct ggt gtg tgc ttt gga aat 96 
Asp Pro Arg Cys Ser Gin Pro lie Ala Ser Gly Val Cys Phe Gly Asn 
20 25 30 

att gaa aaa ttc gga tac gac ate gac gag cac aaa tgt gta cag ttc 144 
lie Glu Lys Phe Gly Tyr Asp lie Asp Glu His Lys Cys Val Gin Phe 
35 40 45 

gtg tac gga gga tgc ttt ggc aat gac aac caa ttc gac teg ctt gaa 192 
Val Tyr Gly Gly Cys Phe Gly Asn Asp Asn Gin Phe Asp Ser Leu Glu 
SO 55 60 

gaa tgt caa gca gtt tgt cct taa ccattccgat gtttataaat gacgtgtata 246 
Glu Cys Gin Ala Val Cys Pro * 
65 70 

taatgeaaga atgeattata gecaatcaat cgatttttaa tcgattcaga ageegttate 3 06 
gattatgaca ttgctgtgca attttctaaa tatttaattt agtgttattc atattcactt 366 
tcaa 370 

<210> 14 
<211> 71 
<212> PRT 

<213> Manduca sexta 
<400> 14 

Leu Tyr Cys Leu Leu Phe Leu Cys Phe lie Thr Phe Ser Met Ser Glu 

15 10 15 

Asp Pro Arg Cys Ser Gin Pro He Ala Ser Gly Val Cys Phe Gly Asn 

20 25 30 

He Glu Lys Phe Gly Tyr Asp He Asp Glu His Lys Cys Val Gin Phe 

35 40 45 

Val Tyr Gly Gly Cys Phe Gly Asn Asp Asn Gin Phe Asp Ser Leu Glu 

50 55 60 

Glu Cys Gin Ala Val Cys Pro 
65 70 



<210> 15 
<211> 948 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (23) . . . (208) 

<400> 15 

gaatteggea cgagggttga ca atg aaa age caa ttg caa ate gta ttg ttg 52 

Met Lys Ser Gin Leu Gin He Val Leu Leu 
15 10 

ttg ctg acg gtg atg ttt gca ata act tat gee ggt tac tac aca aca 100 
Leu Leu Thr Val Met Phe Ala He Thr Tyr Ala Gly Tyr Tyr Thr Thr 
15 20 ~ 25 

aca caa cgt cat ttt gca gta age tgc agt caa get tgt gaa tea gaa 148 
Thr Gin Arg His Phe Ala Val Ser Cys Ser Gin Ala Cys Glu Ser Glu 
30 35 40 

gga age aac tgt gaa ttg gtt aga age tat gta tgg act tgc tat tgt 196 
Gly Ser Asn Cys Glu Leu Val Arg Ser Tyr Val Trp Thr Cys Tyr Cys 
4 5 50 55 

-9- 
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tat tgt cca tga ttttggctat gtttccaaga acatagtttt attatatggt 248 
Tyr Cys Pro * 
60 

gtaacacgaa aggaaaataa ttattttact gaagaatatt tttacaagaa agaaataaga 308 
gacaagaaag aaaaaaaaac aagacagtta tattttgtaa gaaggggacc tcgtgcatca 368 
gaaaggaaat gtagttaatc atttaaagga ctgtatatgt tttaaatttt tctcacgaaa 428 
tgaatctgaa gtgatttttc tgacgactac gaaaattgtc gcggacataa tatatatttc 488 
tgacaaatcc taatttgcac aggaatattt gaaagtggta tttaagctta tgcactgcgc 548 
agtgtccttg tatataatca ttttactatt caagttgaat gaaacaattg aaatttgcat 608 
caaattgtgc tttgtaaatc tcttatggtc acatcttacg gctgcatcat gtgtcaaccg 668 
agagatattt tatcgtaata ttaagttcta cgctggtggt tatgttttaa ttgtttagtg 728 
tcatttacca agtacatctc taaatttcta gtttcagttt agatttttaa gcggaatatt 788 
ttaatctgta ataactacat atccttgaag gagtaggcag aggcgcaacg ctgcattccc 848 
ttttcgccgt gtgtattaca tcccatgata tgatgagggg cgagcctatc gccgtatcgg 908 
ggataaattc ccgattccgg gctgatactg agaagaaaaa 948 

<210> 16 
<211> 61 
<212> PRT 

<213> Manduca sexta 
<400> 16 

Met Lys Ser Gin Leu Gin lie Val Leu Leu Leu Leu Thr Val Met Phe 

15 10 15 

Ala lie Thr Tyr Ala Gly Tyr Tyr Thr Thr Thr Gin Arg His Phe Ala 

20 25 30 

Val Ser Cys Ser Gin Ala Cys Glu Ser Glu Gly Ser Asn Cys Glu Leu 

35 40 45 

Val Arg Ser Tyr Val Trp Thr Cys Tyr Cys Tyr Cys Pro 
50 55 60 



<210> 17 
<211> 254 
<212> PRT 

<213> Trichoplusia ni 
<400> 17 

Met Phe Thr Tyr Lys Leu lie Leu Gly Leu Val Leu Val Val Ser Ala 

15 10 15 

Ser Ala Arg Tyr Leu Val Phe Glu Asp Leu Glu Gly Glu Ser Tyr Leu 

20 25 30 

Val Pro Asn Gin Ala Glu Asp Glu Gin Val Leu Glu Gly Glu Pro Phe 

35 40 45 

Tyr Glu Asn Ala Val Gin Leu Ala Ser Pro Arg Val Arg Arg Gin Ala 

50 55 60 

Gin Gly Ser Val Thr Leu Asn Ser Asp Gly Ser Met Gly Leu Gly Ala 
65 70 75 80 

Lys Val Pro He Val Gly Asn Glu Lys Asn Val Leu Ser Ala Leu Gly 

85 90 95 

Ser Val Asp Leu Asn Asp Gin Leu Lys Pro Ala Ser Arg Gly Met Gly 

100 105 no 

Leu Ala Leu Asp Asn Val Asn Gly His Gly Leu Ser Val Met Lys Glu 

115 120 125 

Thr Val Pro Gly Phe Gly Asp Arg Leu Thr Gly Ala Gly Arg Val Asn 

130 135 140 

Val Phe His Asn Asp Asn His Asp He Ser Ala Lys Ala Phe Val Thr 
145 150 155 160 

Lys Asn Met Pro Asp Phe Pro Asn Val Pro Asn Phe Asn Thr Val Gly 

165 170 i7 5 

Gly Gly Val Asp Tyr Met Tyr Lys Asn Lys Val Gly Ala Ser Leu Gly 

180 185 ' 190 

Met Ala Asn Thr Pro Phe Leu Asp Arg Lys Asp Tyr Ser Ala Met. Gly 
195 200 205 

- 10- 
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Asn Leu Asn Val Phe Arg Ser Pro 

210 215 
Gly Phe Lys Lys Phe Asp Thr Pro 
225 230 
Asn Phe Gly Leu Thr Phe Ser Arg 
245 



Thr Thr Ser Val Asp Phe Asn Ala 
220 

Val Phe Lys Ser Asn Trp Glu Pro 
235 240 
Ser Phe Gly Asn Lys Trp 
250 



<210> 18 
<211> 233 
<212> PRT 

<213> Hyalophora cecropia 



<400> 18 



Met 


Pne 


7» -1 _ 

Ala 


Lys 


Leu 


Phe 


Leu 


Val 


Ser 


Val 


Leu Leu 


Val 


Gly Val 


Asn 


1 








5 










10 








15 




Ser 


Arg 


Tyr 


Val 


Leu 


Val 


Glu 


Glu 


Pro 


Gly Tyr Tyr Asp 


Lys 


Gin 


Tyr 








20 










25 








30 






Glu 


Glu 


Gin 


Pro 


Gin 


Gin 


Trp 


Val 


Asn 


Ser 


Arg Val 


Arg 


Arg 


Gin 


Ala 






35 










40 








45 






Gly 


Ala 


Leu 


Thr 


lie 


Asn 


Ser Asp Gly 


Thr Ser Gly Ala 


Val 


Val 


Lys 




50 










55 








60 








Val 


Pro 


lie 


Thr 


Gly Asn 


Glu 


Asn 


His 


Lys 


Phe Ser 


Ala 


Leu 


Gly 


Ser 


65 










70 










75 






80 


Val 


Asp 


Leu 


Thr 


Asn 


Gin 


Met 


Lys 


Leu 


Gly Ala Ala 


Thr 


Ala 


Gly 


Leu 










85 










90 








95 




Ala 


Tyr 


Asp 


Asn 


Val 


Asn 


Gly 


His 


Gly Ala 


Thr Leu 


Thr 


Lys 


Thr 


His 








100 










105 








110 






lie 


Pro 


Gly 


Phe 


Gly Asp 


Lys 


Met 


Thr 


Ala 


Ala Gly 


Lys 


Val 


Asn 


Leu 






115 










120 






125 








Phe 


His 


Asn 


Asp 


Asn 


His 


Asp 


Phe 


Ser 


Ala 


Lys Ala 


Phe 


Ala 


Thr 


Lys 




130 










135 








140 








Asn 


Met 


Pro 


Asn 


lie 


Pro 


Gin 


Val 


Pro 


Asn 


Phe Asn 


Thr 


Val 


Gly Ala 


145 










150 










155 








160 


Gly 


Val 


Asp 


Tyr 


Met 


Phe 


Lys 


Asp Lys 


He 


Gly Ala 


Ser 


Ala 


Asn 


Ala 










165 










170 








175 




Ala 


His 


Thr 


Asp 


Phe 


He 


Asn 


Arg 


Asn 


Asp 


Tyr Ser 


Leu 


Gly Gly 


Lys 








180 










185 








190 




Leu 


Asn 


Leu 


Phe 


Lys 


Thr 


Pro 


Thr 


Thr 


Ser 


Leu Asp 


Phe 


Asn 


Ala 


Gly 






195 










200 








205 






Trp 


Lys 


Lys 


Phe 


Asp 


Thr 


Pro 


Phe 


Phe 


Lys 


Ser Ser 


Trp 


Glu 


Pro 


Ser 




210 










215 








220 








Thr 


Ser 


Phe 


Ser 


Phe 


Ser 


Lys 


Tyr 


Phe 














225 










230 





















<210> 19 
<211> 235 
<212> PRT 

<213> Hyalophora cecropia 



<400> 19 



Met 


Phe 


Gly 


Lys 


He Val 


Phe Leu Leu Leu Val Ala Leu 


Cys Ala 


Gly 


1 








5 


10 


15 


Val 


Gin 


Ser 


Arg 


Tyr Leu 


He Val Ser Glu Pro Val Tyr 


Tyr He 


Glu 


His 






20 




25 


30 




Tyr 


Glu 


Glu 


Pro Glu 


Leu Leu Ala Ser Ser Arg Val 


Arg Arg 


Asp 






35 






40 45 




Ala 


His 


Gly 


Ala 


Leu Thr 


Leu Asn Ser Asp Gly Thr Ser 


Gly Ala 


Val 




50 








55 60 






Val 


Lys 


Val 


Pro 


Phe Ala 


Gly Asn Asp Lys Asn lie Val 


Ser Ala 


He 


65 








70 


75 




ao 


Gly 


Ser 


Val 


Asp 


Leu Thr 


Asp Arg Gin Lys Leu Gly Ala 


Ala Thr 


Ala 










85 


90 


95 




Gly 


Val 


Ala 


Leu 


Asp Asn 


He Asn Gly His Gly Leu Ser Leu Thr Asp 








100 




105 


110 
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Thr His lie Pro 
115 

Asn Val Phe His 
130 

Thr Arg Asn Met 
145 

Gly Gly Gly lie 

Ser Ala Ala His 
180 

Gly Lys Leu Asn 
195 

Ala Gly Phe Lys 
210 

Pro Asn Phe Gly 

225 



Gly Phe Gly Asp 
120 

Asn Asp Asn His 
135 

Pro Asp lie Ala 
150 

Asp Tyr Met Phe 
165 

Thr Asp Phe lie 

Leu Phe Lys Thr 
200 

Lys Phe Asp Thr 
215 

Phe Ser Leu Ser 
230 



Lys Met Thr Ala 

Asp lie Thr Ala 
140 

Asn Val Pro Asn 
155 

Lys Asp Lys He 
170 

Asn Arg Asn Asp 
185 

Pro Asp Thr Ser 

Pro Phe Met Lys 

220 

Lys Tyr Phe 
235 



Ala Gly Lys Val 
125 

Lys Ala Phe Ala 

Phe Asn Thr Val 
160 

Gly Ala Ser Ala 
175 

Tyr Ser Leu Asp 
190 

He Asp Phe Asn 
205 

Ser Ser Trp Glu 



<210> 20 

<211> 214 

<212> PRT 

<213> Botnbyx mori 

<400> 20 

Met Ser Lys Ser Val Ala Leu Leu Leu Leu Cys Ala Cys Leu Ala Ser 

1 5 10 15 

Gly Arg His Val Pro Thr Arg Ala Arg Arg Gin Ala Gly Ser Phe Thr 

20 25 30 

Val Asn Ser Asp Gly Thr Ser Gly Ala Ala Leu Lys Val Pro Leu Thr 

35 40 45 

Gly Asn Asp Lys Asn Val Leu Ser Ala He Gly Ser Ala Asp Phe Asn 

50 55 60 

Asp Arg His Lys Leu Ser Ala Ala Ser Ala Gly Leu Ala Leu Asp Asn 
65 70 75 80 

Val Asn Gly His Gly Leu Ser Leu Thr Gly Thr Arg He Pro Gly Phe 

85 90 95 

Gly Glu Gin Leu Gly Val Ala Gly Lys Val Asn Leu Phe His Asn Asn 

100 105 HO 

Asn His Asp Leu Ser Ala Lys Ala Phe Ala He Arg Asn Ser Pro Ser 

115 120 125 

Ala He Pro Asn Ala Pro Asn Phe Asn Thr Leu Gly Gly Gly Val Asp 
130 135 140 

Tyr Met Phe Lys Gin Lys Val Gly Ala Ser Leu Ser Ala Ala His Ser 
145 150 155 160 

Asp Val He Asn Arg Asn Asp Tyr Ser Ala Gly Gly Lys Leu Asn Leu 

165 170 175 

Phe Arg Ser Pro Ser Ser Ser Leu Asp Phe Asn Ala Gly Phe Lys Lys 

180 185 190 

Phe Asp Thr Pro Phe Tyr Arg Ser Ser Trp Glu Pro Asn Val Gly Phe 

195 200 205 

Ser Phe Ser Lys Phe Phe 
210 



<210> 21 
<211> 326 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) . . . (177) 

<400> 21 

atg agt gaa gat ccg aga tgt tct cag ccg att gca tct ggt gtg tgc 4 8 
Met Ser Glu Asp Pro Arg Cys Ser Gin Pro He Ala Ser Gly Val Cys 
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15 10 15 

ttt gga aat att gaa aaa ttc gga tac gac ate gac gag cac aaa tgt 96 
Phe Gly Asn lie Glu Lys Phe Gly Tyr Asp lie Asp Glu His Lys Cys 
20 25 30 

gta cag ttc gtg tac gga gga tgc ttt ggc aat gat aac caa ttc gac 144 
Val Gin Phe Val Tyr Gly Gly Cys Phe Gly Asn Asp Asn Gin Phe Asp 
35 40 45 

teg ctt gaa gaa tgt caa gca gtt tgt cct taa ccattccaat gtttataaat 197 
Ser Leu Glu Glu Cys Gin Ala Val Cys Pro * 
50 55 

gacgtgtata taatacacac aataatcaat cgatttttaa tcgattcaga ageegttate 257 
tattactaaa ttgctgtgca attttataaa tatttaattt agtgttatta atattcactt 317 
tcaaaaata 326 

<210> 22 
<211> 58 
<212> PRT 

<213> Manduca sexta 
<400> 22 

Met Ser Glu Asp Pro Arg Cys Ser Gin Pro lie Ala Ser Gly Val Cys 

1 5 10 15 

Phe Gly Asn lie Glu Lys Phe Gly Tyr Asp lie Asp Glu His Lys Cys 

20 25 3 0 

Val Gin Phe Val Tyr Gly Gly Cys Phe Gly Asn Asp Asn Gin Phe Asp 

35 .40 45 

Ser Leu Glu Glu Cys Gin Ala Val Cys Pro 
50 55 



<210> 23 
<211> 58 
<212> PRT 

<213> Manduca sexta 
<400> 23 

Met Ser Glu Asp Pro Arg Cys Ser Gin Pro lie Ala Ser Gly Val Cys 

1 5 10 15 

Phe Gly Asn lie Glu Lys Phe Gly Tyr Asp He Asp Glu His Lys Cys 

20 25 30 

Val Gin Phe Val Tyr Gly Gly Cys Phe Gly Asn Asp Asn Gin Phe Asp 

35 40 45 

Ser Leu Glu Glu Cys Gin Ala Val Cys Pro 
50 55 



<210> 24 
<211> 365 
<212> DNA 

<213> Heliothis virescens 

<220> 

<221> CDS 

<222> (1) . . . (192) 

<400> 24 

atg aat ttc teg egg ata ttt ttc 
Met Asn Phe Ser Arg He Phe Phe 
1 5 

gtg tgc age gtg teg gcg gcg cct 
Val Cys Ser Val Ser Ala Ala Pro 



ttc gtg ttc gcg tgt ttg gta gca 48 
Phe Val Phe Ala Cys Leu Val Ala 
10 15 

gag ccg agg tgg aag gtc ttc aag 96 
Glu Pro Arg Trp Lys Val Phe Lys 
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20 25 30 

aaa att gag aag atg ggt cgc aac ata agg gac ggt gtc ate aaa get 144 
Lys He Glu Lys Met Gly Arg Asn He Arg Asp Gly Val He Lys Ala 
35 40 45 

gcg cca get ate gaa gtc ctg ggc cag get aaa get ctt gga aaa tag 192 
Ala Pro Ala He Glu Val Leu Gly Gin Ala Lys Ala Leu Gly Lys * 
50 55 60 

atcttaacta ttaaggaata acgttcaaag tattataagt gttcattacc tcgaatatca 252 
aagaatatct tatgtatttt ttttttttgt aaatattttt gcgtttattt tatgtaatac 312 
teagagtgea tgcaattaaa ttgttttaaa gcgttaaaaa aaaaaaaaaa aaa 365 

<210> 25 
<211> 63 
<212> PRT 

<:213> Heliothis virescens 
<400> 25 

Met Asn Phe Ser Arg He Phe Phe Phe Val Phe Ala Cys Leu Val Ala 

1 5 10 is 

Val Cys Ser Val Ser Ala Ala Pro Glu Pro Arg Trp Lys Val Phe Lys 

20 25 30 

Lys He Glu Lys Met Gly Arg Asn He Arg Asp Gly Val He Lys Ala 

35 40 45 

Ala Pro Ala He Glu Val Leu Gly Gin Ala Lys Ala Leu Gly Lvs 
50 55 60 



<210> 26 
<211> 63 
<212> PRT 

<213> Heliothis virescens 
<400> 26 

Met Asn Phe Ser Arg He Phe Phe Phe Val Phe Ala Cys Leu Val Ala 

1 5 io 15 

Val Cys Ser Val Ser Ala Ala Pro Glu Pro Arg Trp Lys Val Phe Lys 

20 25 30 

Lys He Glu Lys Met Gly Arg Asn He Arg Asp Gly Val He Lys Ala 

35 40 45 

Ala Pro Ala He Glu Val Leu Gly Gin Ala Lys Ala Leu Gly Lys 
50 55 .60 

<210> 27 
<211> 600 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (36) . . . (464) 

<400> 27 

actagtggat cccccgggct geaggaeggg ctaca atg tct aag ttt ata tec 53 

Met Ser Lys Phe He Ser 
1 5 

ata ctt tgt gtt gtc gec tta ctg eta ata gca gaa act tat tgt tta 101 
He Leu Cys Val Val Ala Leu Leu Leu He Ala Glu Thr Tyr Cys Leu 
10 15 20 

aca agt ggt gtt cgc ate ata caa ccc act tat agg cct cca ccc aaa 149 
Thr Ser Gly Val Arg He He Gin Pro Thr Tyr Arg Pro Pro Pro Arg 
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25 30 35 

aga cct gtt att tac aga get gca cgc gac get gga gat gaa ccc ttg 197 
Arg Pro Val He Tyr Arg Ala Ala Arg Asp Ala Gly Asp Glu Pro Leu 
40 45 50 

fcgg ctg tac caa gga gac gac cac cct cga gec cct tea age ggc gac 245 
Trp Leu Tyr Gin Gly Asp Asp His Pro Arg Ala Pro Ser Ser Gly Asp 
55 60 65 70 

cat cct gta ctg ccc teg ate ata gac gat gtg aag ctg gac ccc aac 293 
His Pro Val Leu Pro Ser He He Asp Asp Val Lys Leu Asp Pro Asn 
75 80 85 

agg egg tat gcg cgt agt gta age gag cct teg tea cag gag cat cat 341 
Arg Arg Tyr Ala Arg Ser Val Ser Glu Pro Ser Ser Gin Glu His His 
90 95 100 

gac cgc ttt gcg agg age ttc gac tec cgc age age aag cat cac ggc 3 89 
Asp Arg Phe Ala Arg Ser Phe Asp Ser Arg Ser Ser Lys His His Gly 
105 no ~ 115 

ggc agt cac tec acg tec ggc ggc age cgc gac act gga get act cac 437 
Gly Ser His Ser Thr Ser Gly Gly Ser Arg Asp Thr Gly Ala Thr His 
120 125 130 

teg gga tac aat cgt cgt aac tea taa tttctcttca gtttctaaat 484 
Ser Gly Tyr Asn Arg Arg Asn Ser * 
135 140 

atttttgttt ctgctactaa ttttttctca tcaatattct tgtttgcttt caaatctttc 544 
attttatgat aataatatgt atactgatca ttatattgaa ataaatgatt aaattg 600 

<210> 28 
<211> 142 
<212> PRT 

<213> Manduca sexta 
<400> 28 

Met Ser Lys Phe He Ser lie Leu Cys Val Val Ala Leu Leu Leu He 

1 5 io 15 

Ala Glu Thr . Tyr Cys Leu Thr Ser Gly Val Arg He He Gin Pro Thr 

20 25 30 

Tyr Arg Pro Pro Pro Arg Arg Pro Val He Tyr Arg Ala Ala Arg Asp 

35 40 45 

Ala Gly Asp Glu Pro Leu Trp Leu Tyr Gin Gly Asp Asp His Pro Arg 

50 55 60 

Ala Pro Ser Ser Gly Asp His Pro Val Leu Pro Ser He He Asp Asp 
65 70 75 80 

Val Lys Leu Asp Pro Asn Arg Arg Tyr Ala Arg Ser Val Ser Glu Pro 

85 90 95 

Ser Ser Gin Glu His His Asp Arg Phe Ala Arg Ser Phe Asp Ser Arg 

100 105 HO 

Ser Ser Lys His His Gly Gly Ser His Ser Thr Ser Gly Gly Ser Arg 

115 120 125 

Asp Thr Gly Ala Thr His Ser Gly Tyr Asn Arg Arg Asn Ser 
130 135 140 



<210> 29 
<211> 142 
<212> PRT 

<213> Manduca sexta 
<400> 29 

Met Ser Lys Phe He Ser He Leu Cys Val Val Ala Leu Leu Leu lie 
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1 5 
Ala Glu Thr Tyr Cys 
20 

Tyr Arg Pro Pro Pro 
35 

Ala Gly Asp Glu Pro 
50 

Ala Pro Ser Ser Gly 
65 

Val Lys Leu Asp Pro 
85 

Ser Ser Gin Glu His 
100 

Ser Ser Lys His His 
115 

Asp Thr Gly Ala Thr 
130 



10 

Leu Thr Ser Gly Val Arg 
25 

Arg Arg Pro Val lie Tyr 
40 

Leu Trp Leu Tyr Gin Gly 
55 

Asp His Pro Val Leu Pro 
70 75 
Asn Arg Arg Tyr Ala Arg 
90 

His Asp Arg Phe Ala Arg 
105 

Gly Gly Ser His Ser Thr 
120 

His Ser Gly Tyr Asn Arg 
13 5 



15 

He He Gin Pro Thr 
30 

Arg Ala Ala Arg Asp 
45 

Asp Asp His Pro Arg 
60 

Ser He He Asp Asp 
80 

Ser Val Ser Glu Pro 
95 

Ser Phe Asp Ser Arg 
110 

Ser Gly Gly Ser Arg 
125 

Arg Asn Ser 
140 



<210> 30 
<211> 360 
<212> DNA 

<213> Ostrinia nubilalis 

<220> 

<221> CDS 

<222> (1) . . . (201) 



<400> 30 



S S ££ S 5 £ 2 £2 K S3 SS S ill J2 » IS 



5 10 



15 



Phe S A?a Val Ser ST 2?" ^ CCt ^ aat CCt ttfc 

Ala Ala Val Ser Ala Ala Pro Asn Pro Arg Trp Asn Pro Phe Lys 



48 



96 



25 



30 



144 



192 



aaa ctg gag cgt gtg ggc cag aac ate cgt gac gqq ate ate „™ 
L ys Leu Glu Arg Val Gly Gin Asn He A^g Lp gj Tit 1^ Lys^ K 

40 45 

IS Pro Ala f? a ?*? 9t9 " C CM 9Ct * CC acc ata tac aag gge 

Ala Pro Ala Val Ala Val Val Gly Gin Ala Ala Thr He Tyr hys lly 

55 60 

l?y Lys ataaCtacat <=atcatcatc gtcatcatca tcatcatctg 241 

65 

SSSS batata SSSS "" atat9a "tcatgtgg aeaageatet 301 
actt tttgtatata attttgtacc aaaaatggta tggtaaagtt atgaaacgt 360 

<210> 31 
<211> 66 
<212> PRT 

<213> Ostrinia nubilalis 
<400> 31 

Met Asn Phe Ser Lys He Leu Phe Ala Val Phe Ala He Phe Met Ala 
Phe Ala Ala Val Ser Ala Ala Pro Asn Pro Arg Trp Asn Pro IL Lys 
Lys Leu Glu Arg Val Gly Gin Asn lie Arg Asp Gly He l!e Lys Ala 
Ala Pro Ala Val Ala Val Val lly Gin Ala Ala Thr lie Tyr Lys Gly 



16 
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Gly Lys 
65 



<210> 32 
<211> 66 
<212> PRT 

<213> Ostrinia nubilalis 
<400> 32 

Met Asn Phe Ser Lys lie Leu Phe Ala Val Phe Ala lie Phe Met Ala 

15 10 15 

Phe Ala Ala Val Ser Ala Ala Pro Asn Pro Arg Trp Asn Pro Phe Lys 

20 25 30 

Lys Leu Glu Arg Val Gly Gin Asn lie Arg Asp Gly lie He Lys Ala 

35 40 45 

Ala Pro Ala Val Ala Val Val Gly Gin Ala Ala Thr He Tyr Lys Gly 

50 55 60 

Gly Lys 
65 



<210> 33 
<211> 407 
<212> DNA 

<213> Ostrinia nubilalis 

<220> 

<221> CDS 

<222> (3) . . . (281) 

<221> misc_feature 
<222> 378 

<223> n = A,T,C or G 
<400> 33 

gg cac cag gta gtg ttg tgt tec ctg gec gec gtg ctt ctg gcg ttc 47 
His Gin Val Val Leu Cys Ser Leu Ala Ala Val Leu Leu Ala Phe 
1 5 10 15 

gtc get gaa teg tea gcg cag cgt ttc ate cag ccg acc tac agg ccg 95 
Val Ala Glu Ser Ser Ala Gin Arg Phe He Gin Pro Thr Tyr Arg Pro 
20 25 30 

ccg cct caa cga cca ccg aag ata tac aga ctg cga aga gat gca ggc 143 
Pro Pro Gin Arg Pro Pro Lys lie Tyr Arg Leu Arg Arg Asp Ala Gly 
3 5 40 45 

gaa ccg eta tgg ctg tac caa ggt gat gat gtt cag cga gec cca gec 191 
Glu Pro Leu Trp Leu Tyr Gin Gly Asp Asp Val Gin Arg Ala Pro Ala 
50 55 60 

acc ggc gac cat cct tac ctt ccg cca aac ate gac gac ate cat eta 23 9 
Thr Gly Asp His Pro Tyr Leu Pro Pro Asn lie Asp Asp He His Leu 
65 70 75 

gac ccc aac acc aag ata cgc teg cag cgt cga etc tec tag 281 
Asp Pro Asn Thr Lys He Arg Ser Gin Arg Arg Leu Ser * 
80 85 90 

cgctaagcgt ggaggaggca gccacagcac ctccagtggg aagcaaggga cactggcgca 341 
acgcaccccg gggtacaatc ggccgcaacg cccgaangca taagattcga ccccatctcc 401 
cegget 407 

<210> 34 
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<211> 90 
<212> PRT 

<213> Ostrinia nubilalis 



<4Q0> 34 








vol 
1 


Va J. 


Leu 


Cys 


Ser 
5 


Leu Ala Ala 


Ser 


Ser 


Ala 


Gin 


Arg 


Phe lie Gin 








20 






Arg 


Pro 


Pro 


Lys 


lie 


Tyr Arg Leu 






35 






40 


Trp 


Leu 


Tyr 


Gin 


Gly Asp Asp Val 




50 








55 


His 


Pro 


Tyr 


Leu 


Pro 


Pro Asn lie 


65 










70 


Thr 


Lys 


lie 


Arg 


Ser 


Gin Arg Arg 



85 



Val Leu Leu Ala Phe Val Ala Glu 

10 15 
Pro Thr Tyr Arg Pro Pro Pro Gin 
25 30 
Arg Arg Asp Ala Gly Glu Pro Leu 
45 

Gin Arg Ala Pro Ala Thr Gly Asp 
60 

Asp Asp lie His Leu Asp Pro Asn 
75 80 

Leu Ser 
90 



<210> 35 
<211> 92 
<212> PRT 

<213> Ostrinia nubilalis 



<400> 35 
His Gin Val Val 
1 

Ala Glu Ser Ser 
20 

Pro Gin Arg Pro 
35 

Pro Leu Trp Leu 
50 

Gly Asp His Pro 
65 

Pro Asn Thr Lys 



Leu Cys Ser Leu 
5 

Ala Gin Arg Phe 

Pro Lys He Tyr 
40 

Tyr Gin Gly Asp 
55 

Tyr Leu Pro Pro 
70 

He Arg Ser Gin 
85 



Ala Ala Val Leu 
10 

He Gin Pro Thr 

25 

Arg Leu Arg Arg 

Asp Val Gin Arg 
60 

Asn lie Asp Asp 
75 

Arg Arg Leu Ser 
90 



Leu Ala Phe Val 
15 

Tyr Arg Pro Pro 
30 

Asp Ala Gly Glu 
45 

Ala Pro Ala Thr 

lie His Leu Asp 
80 



<210> 36 
<211> 362 
<212> DNA 

<213> Ostrinia nubilalis 

<220> 

<221> CDS 

<222> (1) . . . (252) 



^400> 36 

m^? Ph. o 9t ttfc att a " ttC atg tt9 ^ 9cc att gcg age 48 

Met Phe Lys Leu Ser Phe He He Phe Met Leu Val Ala He Ala Ser 

1 5 10 is 

gtt tta age agt gaa gec cca gec cca gac tgc acc teg cct ctt gag 96 
Val Leu Ser Ser Glu Ala Pro Ala Pro Asp Cys Thr Ser Pro Leu Glu 
20 25 30 

™° H?* S° a t9C a9a "° a " aaa 9tt gGt ttc 99 c tac 9at act gac 144 
Thr Gly Pro Cys Arg Gly Arg Lys Val Ala Phe Gly Tyr Asp Thr Asp 
35 40 45 

ttg gaa gga tgc aaa cag ttc ate tac gga gga tgt gac ggc aac ggc 192 
Leu Glu Gly Cys Lys Gin Phe He Tyr Gly Gly Cys Asp Gly Asn Gly 
50 55 60 

aac cgt tac aac act eta gag gag tgt cag get get tgc gag agt gac 24 0 
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Asn Arg Tyr Asn Thr Leu Glu Glu Cys Gin Ala Ala Cys Glu Ser Asp 
65 70 75 80 

tgc aac aaa taa taacgaaatg caagcaatca attgggtatt tgacagcaca 292 
Cys Asn Lys * 



gtcaattgac atactttttt taaactgtca aaacgcaaca ttccctattt ttcacatttt 352 
gcaaagtaga 362 

<210> 37 
<211> 83 
<212> PRT 

<213> Ostrinia nubilalis 



<400> 37 



Met 


Phe 


Lys 


Leu 


Ser 


Phe 


He 


He 


Phe Met 


Leu 


Val 


Ala 


He 


Ala Ser 


1 








5 








10 










15 


Val 


Leu 


Ser 


Ser 


Glu 


Ala 


Pro 


Ala 


Pro Asp 


Cys 


Thr 


Ser 


Pro 


Leu Glu 








20 










25 








30 




Thr 


Gly 


Pro 


Cys 


Arg 


Gly 


Arg 


Lys 


Val Ala 


Phe 


Gly 


Tyr 


Asp 


Thr Asp 






35 










40 








45 






Leu 


Glu 


Gly 


Cys 


Lys 


Gin 


Phe 


He 


Tyr Gly Gly Cys 


Asp 


Gly Asn Gly 




50 










55 








60 








Asn 


Arg 


Tyr 


Asn 


Thr 


Leu 


Glu 


Glu 


Cys Gin 


Ala 


Ala 


Cys 


Glu 


Ser Asp 


65 










70 








75 








80 


Cys 


Asn 


Lys 

























<210> 38 

<211> 83 

<212> PRT , 

<213> Ostrinia nubilalis 



<400> 38 



Met Phe 


Lys 


Leu 


Ser 


Phe 


He 


He 


Phe Met 


Leu Val 


Ala 


He 


Ala 


Ser 


1 






5 








10 








15 




Val Leu 


Ser 


Ser 


Glu 


Ala 


Pro 


Ala 


Pro Asp 


Cys Thr 


Ser 


Pro 


Leu 


Glu 






20 










25 






30 






Thr Gly 


Pro 


Cys 


Arg 


Gly Arg 


Lys 


Val Ala 


Phe Gly Tyr 


Asp 


Thr 


Asp 




35 










40 






45 








Leu Glu 


Gly 


Cys 


Lys 


Gin 


Phe 


He 


Tyr Gly Gly Cys 


Asp 


Gly Asn Gly 


50 










55 






60 










Asn Arg 


Tyr 


Asn 


Thr 


Leu 


Glu 


Glu 


Cys Gin 


Ala Ala 


Cys 


Glu 


Ser 


Asp 


65 








70 








75 








80 



Cys Asn Lys 



<210> 39 
<211> 242 
<212> DNA 

<213> Ostrinia nubilalis 

<220> 

<221> CDS 

<222> (1) . . . (201) 

<400> 39 

atg aat ttc tec aaa att ctt ttc 
Met Asn Phe Ser Lys He Leu Phe 
1 5 

ttc gec gec gtg tea get get cct 
Phe Ala Ala Val Ser Ala Ala Pro 



gcg ate ttc get tgt ttc atg gcg 48 
Ala He Phe Ala Cys Phe Met Ala 
10 15 

gaa cca aga tgg aac ccg ttt aag 96 
Glu Pro Arg Trp Asn Pro Phe Lys 
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20 25 30 

aaa ctt gag cga gtg ggc cag aac ate cga gac ggc ate gtg aag gca 144 
Lys Leu Glu Arg Val Gly Gin Asn lie Arg Asp Gly lie Val Lys Ala 
35 40 45 

caa cca get ate caa gta gtg gga gaa gcg get aca ata tac aga ggt 192 
Gin Pro Ala He Gin Val Val Gly Glu Ala Ala Thr He Tyr Arg Gly 
50 55 60 

ggt aaa taa tttaccacat agcaaacatc gtctagttta aaaatcgaat 241 
Gly Lys * 
65 



<210> 40 
<211> 66 
<212> PRT 

<213> Ostrinia nubilalis 
<400> 40 

Met Asn Phe Ser Lys He Leu Phe Ala He Phe Ala Cys Phe Met Ala 

1 5 io 15 

Phe Ala Ala Val Ser Ala Ala Pro Glu Pro Arg Trp Asn Pro Phe Lys 

20 25 30 

Lys Leu Glu Arg Val Gly Gin Asn He Arg Asp Gly He Val Lys Ala 

35 40 45 

Gin Pro Ala He Gin Val Val Gly Glu Ala Ala Thr He Tyr Arg Gly 

50 55 60 

Gly Lys 
65 



242 



<210> 41 
<211> 63 
<212> PRT 

<213> Ostrinia nubilalis 
<400> 41 

Met Asn Phe Ser Lys He Leu Phe Ala He Phe Ala Cys Phe Met Ala 

15 10 15 

Phe Ala Ala Val Ser Ala Ala Pro Glu Pro Arg Trp Asn Pro Phe Lys 

20 25 30 

Lys Leu Glu Arg Val Gly Gin Asn He Arg Asp Gly He Val Lys Ala 

35 40 45 

Gin Pro Ala He Gin Val Val Gly Glu Ala Ala Thr He Tyr Arq 
50 55 60 



<210> 42 
<211> 471 
<212> DNA 

<213> Ostrinia nubilalis 

<220> 

<221> CDS 

<222> (1) . . . (198) 

<400> 42 

atg aaa ttt tea aag gtt ttc ttc 
Met Lys Phe Ser Lys Val Phe Phe 

1 .5 

ttt gcg acg gtc acc get teg cca 
Phe Ala Thr Val Thr Ala Ser Pro 



gtt ttc ttc gca ttc gtg get gcg 48 
Val Phe Phe Ala Phe Val Ala Ala 
10 15 

ttc aac tta ggg aag gaa ctg gaa 96 
Phe Asn Leu Gly Lys Glu Leu Glu 
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20 



25 



30 



gga ate ggc cag aga gtg agg gac age ate ate agt gee cga ccg get 
Gly He Gly Gin Arg Val Arg Asp Ser lie He Ser Ala Arg Pro Ala 
35 40 45 



144 



gtt gac acc ate ttg gaa gec cag aag ata ttc aag gga ggc gac aaa 
Val Asp Thr He Leu Glu Ala Gin Lys He Phe Lys Gly Gly Asp Lys 
50 55 60 



192 



gac tga acgaaatgac gtcataattt aaatacaaat atttttttaa gttagtttta 
Asp * 
65 



248 



caacataaaa cgttaatacc tacgtacgtt tgaggaaaaa ctcattagat tattattcat 308 

gtaaattatg tagattagca aaagagaatt tcaaattacc tttgtttgga acteggatte 368 

tgtgatataa tatatgttta ttttaaagta tttagttgta tctattttta ttttcacagt 428 

cagcacattt cctaattaat tttgaacttt gaattagagt aag 471 

<210> 43 
<211> 65 
<212> PRT 

<213> Ostrinia nubilalis 



<400> 43 

Met Lys Phe Ser Lys Val Phe Phe Val Phe Phe Ala Phe Val Ala Ala 

1 5 10 15 

Phe Ala Thr Val Thr Ala Ser Pro Phe Asn Leu Gly Lys Glu Leu Glu 

20 25 30 

Gly He Gly Gin Arg Val Arg Asp Ser He He Ser Ala Arg Pro Ala 

35 40 45 

Val Asp Thr He Leu Glu Ala Gin Lys He Phe Lys Gly Gly Asp Lys 
50 55 * 60 

Asp 
65 



<210> 44 
<211> 60 
<212> PRT 

<213> Ostrinia nubilalis 
<400> 44 

Met Lys Phe Ser Lys Val Phe Phe Val Phe Phe Ala Phe Val Ala Ala 

15 10 15 

Phe Ala Thr Val Thr Ala Ser Pro Phe Asn Leu Gly Lys Glu Leu Glu 

20 25 30 

Gly He Gly Gin Arg Val Arg Asp Ser He He Ser Ala Arg Pro Ala 

35 40 45 

Val Asp Thr He Leu Glu Ala Gin Lys He Phe Lys 
50 55 60 



<210> 45 
<211> 464 
<212> DNA 

<213> Ostrinia nubilalis 

<220> 

<221> CDS 

<222> (1) . . . (464) 

<400> 45 

atg caa cga gta gtg ttg tgt tec ctg gee gee gtg etc ctg gcg ttc 48 

Met Gin Arg Val Val Leu Cys Ser Leu Ala Ala Val Leu Leu Ala Phe 

1 5 10 15 
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gtc get gaa teg tea gcg cag cgt ttc ate cag ccg ace tac agg ccg 96 
Val Ala Glu Ser Ser Ala Gin Arg Phe lie Gin Pro Thr Tyr Arg Pro 
20 25 30 

ccg cct caa cga cca ccg aag ata tac aga ctg cga aga gat gca ggc 14 4 
Pro Pro Gin Arg Pro Pro Lys lie Tyr Arg Leu Arg Arg Asp Ala Gly 
35 4 0 4 5 

gaa ccg eta tgg ctg tac caa ggt gat gat gtt cag cga gcg cca gec 192 
Glu Pro Leu Trp Leu Tyr Gin Gly Asp Asp Val Gin Arg Ala Pro Ala 
50 55 60 

acc ggt gac cac cct tac ctg ccg cca aac ate gac gac ate cat eta 240 
Thr Gly Asp His Pro Tyr Leu Pro Pro Asn lie Asp Asp lie His Leu 
65 70 75 80 

gac ccc aac acc aga tac get cgc age gtc gac tct cct age get aag 2 88 
Asp Pro Asn Thr Arg Tyr Ala Arg Ser Val Asp Ser Pro Ser Ala Lys 
85 90 95 

cgt gga gga ggc age cac age. acc tec agt gga age agg gat act ggc 33 6 
Arg Gly Gly Gly Ser His Ser Thr Ser Ser Gly Ser Arg Asp Thr Gly 
100 105 no 

gee acg cac ccc ggg tac aat cgc cgc aac gee cga age ata aga ttc 3 84 
Ala Thr His Pro Gly Tyr Asn Arg Arg Asn Ala Arg Ser He Arg Phe 
115 120 125 

gac cct ate tct ccg ctg ccg tec ccg act ttc cct aaa cca ttc gac 432 
Asp Pro He Ser Pro Leu Pro Ser Pro Thr Phe Pro Lys Pro Phe Asp 
130 135 140 



ccg ttc aac ccc egg cct gtt teg ccc acc ag 
Pro Phe Asn Pro Arg Pro Val Ser Pro Thr 
145 150 



464 



<210> 46 
<211> 154 
<212> PRT 

<213> Ostrinia nuJbilalis 
<400> 46 

Met Gin Arg Val Val Leu Cys Ser Leu Ala Ala Val Leu Leu Ala Phe 

1 5 io 15 

Val Ala Glu Ser Ser Ala Gin Arg Phe He Gin Pro Thr Tyr Arg Pro 

20 25 3 0 

Pro Pro Gin Arg Pro Pro Lys He Tyr Arg Leu Arg Arg Asp Ala Gly 

35 40 45 

Glu Pro Leu Trp Leu Tyr Gin Gly Asp Asp Val Gin Arg Ala Pro Ala 

50 55 60 

Thr Gly Asp His Pro Tyr Leu Pro Pro Asn He Asp Asp He His Leu 
65 70 75 * 80 

Asp Pro Asn Thr Arg Tyr Ala Arg Ser Val Asp Ser Pro Ser Ala Lys 

85 90 95 

Arg Gly Gly Gly Ser His Ser Thr Ser Ser Gly Ser Arg Asp Thr Gly 

100 105 no 

Ala Thr His Pro Gly Tyr Asn Arg Arg Asn Ala Arg Ser He Arg Phe 

115 120 125 

Asp Pro He Ser Pro Leu Pro Ser Pro Thr Phe Pro Lys Pro Phe Asp 

130 135 140 

Pro Phe Asn Pro Arg Pro Val Ser Pro Thr 
145 150 
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<210> 47 
<211> 181 
<212> PRT 

<213> Ostrinia nubilalis 
<220> 

<221> VARIANT 
<222> 155 

<223> Xaa = Any Amino Acid 
<400> 47 



Met 


Gin 


Ara 


Val 


Val 


Leu 


Cys 


Ser 


Leu 


Ala 


Ala 


Val 


Leu 


Leu 


Ala 


Phe 


1 








5 










10 










15 




Val 


Ala 


Glu 


Ser 


Ser 


Ala 


Gin 


Arg 


Phe 


He 


Gin 


Pro 


Thr 


Tyr Arg 


Pro 








20 










25 










~2 n 
J u 






Pro 


Pro 


Gin 
35 


A TCI 


Pro 


Pro 


Ly s 


He 
40 


Tyr 


Arg 


Leu 


Arg 


Arg 
45 


Asp 


Ala 


Glv 




Pro 


Leu 


i rp 


Leu 


Tyr 




Gly Asp Asp 


vet X 




Arg 


Ala 


Pro 


nJ.d 




50 










55 










60 










Thr 


Gly 


Asp 


His 


Pro 


Tyr 


Leu 


Pro 


Pro 


Asn 


He 


Asp 


Asp 


He 


His 


Leu 


65 










70 










75 










80 


Asp 


Pro 


Asn 


Thr 


Arg 
85 


Tyr 


Ala 


Arg 


Ser 


Val 
90 


Asp 


Ser 


Pro 


Ser 


Ala 
95 


Lys 


Arg 


Gly 


Gly 


Gly 
100 


Ser 


His 


Ser 


Thr 


Ser 
105 


Ser 


Gly 


Ser 


Arg 


Asp 
110 


Thr 


Gly 


Ala 


Thr 


His 
115 


Pro 


Gly 


Tyr 


Asn 


Arg 
120 


Arg 


Asn 


Ala 


Arg 


Ser 
125 


He 


Arg 


Phe 


Asp 


Pro 
130 


He 


Ser 


Pro 


Leu 


Pro 
135 


Ser 


Pro 


Thr 


Phe 


Pro 
140 


Lys 


Pro 


Phe 


Asp 


Pro 


Phe 


Asn 


Pro 


Arg 


Pro 


Val 


Ser 


Pro 


Thr 


Xaa 


Pro 


Phe 


Pro 


Leu 


Tyr 


145 










150 










155 










160 


Ala 


Arg 


Ser 


Arg 


Arg 
165 


Asp 


He 


Gin 


Phe 


Pro 
170 


Gin 


Lys 


Pro 


Lys 


His 
175 


His 


Asp 


He 


Val 


Leu 
180 


Thr 

























<210> 48 
<211> 538 
<212> DNA 

<213> Heliothis virescens 

<220> 

<221> CDS 

<222> (1) . . . (432) 

<400> 48 

atg gca aaa tec att ttc gcg ctt 
Met Ala Lys Ser He Phe Ala Leu 
1 5 

aca gaa tec aac tgt tgg aga agt 
Thr Glu Ser Asn Cys Trp Arg Ser 
20 



gga gtt ate gca gtt ctg ttg ata 48 
Gly Val He Ala Val Leu Leu He 
10 15 

gat etc cct ate ata etc ccg act 96 
Asp Leu Pro He He Leu Pro Thr 
25 30 



tat aaa cct cct cgt acc ccg age acc gtt att ate agg aca gta cgc 144 
Tyr Lys Pro Pro Arg Thr Pro Ser Thr Val He lie Arg Thr Val Arg 
35 40 45 

gaa gec gga gat aaa ccg tta tgg etc tac caa gga gac gat cac ccg 192 
Glu Ala Gly Asp Lys Pro Leu Trp Leu Tyr Gin Gly Asp Asp His Pro 
50 55 60 

cga gee cct tea age ggc gat cat cct gta ctg ccc ccg ate ata gac 240 
Arg Ala Pro Ser Ser Gly Asp His Pro Val Leu Pro Pro He He Asp 
65 70 75 80 
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gat gtg aaa ctg gac ccc aac aga egg tac gcg cgt agt gtg aac gag 288 
Asp Val Lys Leu Asp Pro Asn Arg Arg Tyr Ala Arg Ser Val Asn Glu 
85 90 95 

ccc teg tct cag gag cat cac gaa cgc ttt gtg agg age ttc gac tec 336 
Pro Ser Ser Gin Glu His His Glu Arg Phe Val Arg Ser Phe Asp Ser 
100 105 no 

cgc age age agg cat cac ggc ggc agt cac tec acg tec age ggc age 3 84 
Arg Ser Ser Arg Hxs His Gly Gly Ser His Ser Thr Ser Ser Gly Ser 
115 120 125 

cgc gac act gga get act cat ccg gga tac aat cgt cgt aac tea taa 432 
Arg Asp Thr Gly Ala Thr His Pro Gly Tyr Asn Arg Arg Asn Ser * 
130 13 5 140 

tctgtggttt aatgtattag atatttgtgt ttaacattaa aacatttttg aaattgtcta 492 
ctcgaataaa tacatttacc tattttaaaa aaaaaaaaaa aaaaaa 538 

<210> 49 
<211> 143 
<212> PRT 

<213> Heliothis virescens 
<400> 49 

Met Ala Lys Ser lie Phe Ala Leu Gly Val He Ala Val Leu Leu He 

1 5 io 15 

Thr Glu Ser Asn Cys Trp Arg Ser Asp Leu Pro He He Leu Pro Thr 

20 25 30 

Tyr Lys Pro Pro Arg Thr Pro Ser Thr Val He He Arg Thr Val Arg 

35 4 0 45 

Glu Ala Gly Asp Lys Pro Leu Trp Leu Tyr Gin Gly Asp Asp His Pro 

50 55 60 

Arg Ala Pro Ser Ser Gly Asp His Pro Val Leu Pro Pro He He Asd 
65 70 75 eo 

Asp Val Lys Leu Asp Pro Asn Arg Arg Tyr Ala Arg Ser Val Asn Glu 

85 90 95 

Pro Ser Ser Gin Glu His His Glu Arg Phe Val Arg Ser Phe Asp Ser ' 

100 105 no 

Arg Ser Ser Arg His His Gly Gly Ser His Ser Thr Ser Ser Gly Ser 

115 120 125 

Arg Asp Thr Gly Ala Thr His Pro Gly Tyr Asn Arg Arg Asn Ser 
130 135 140 



<210> 50 
<211> 143 
<212> PRT 

<213> Heliothis virescens 
<400> 50 

Met Ala Lys Ser He Phe Ala Leu Gly Val He Ala Val Leu Leu He 

1 5 io 15 

Thr Glu Ser Asn Cys Trp Arg Ser Asp Leu Pro He He Leu Pro Thr 

20 25 30 

Tyr Lys Pro Pro Arg Thr Pro Ser Thr Val He He Arg Thr Val Arg 

35 40 45 

Glu Ala Gly Asp Lys Pro Leu Trp Leu Tyr Gin Gly Asp Asp His Pro 

50 55 60 

Arg Ala Pro Ser Ser Gly Asp His Pro Val Leu Pro Pro He He Asd 
" 70 75 80 P 

Asp Val Lys Leu Asp Pro Asn Arg Arg Tyr Ala Arg Ser Val Asn Glu 

85 90 95 

Pro Ser Ser Gin Glu His His Glu Arg Phe Val Arg Ser Phe Asp Ser 
1°0 105 no 
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Arg Ser Ser Arg His His Gly Gly Ser His Ser Thr Ser Ser Gly Ser 

115 120 125 

Arg Asp Thr Gly Ala Thr His Pro Gly Tyr Asn Arg Arg Asn Ser 
130 135 140 



<210> 51 

<211> 481 

<212> DNA 

<213> Heliothis virescens 

<220> 

<221> CDS 

<222> (1) . . . (429) 

<400> 51 

atg aag tea gta ctt gta ctt tgc gtt gtt gcg gtg ttg cat acg gca 48 

Met Lys Ser Val Leu Val Leu Cys Val Val Ala Val Leu His Thr Ala 

15 10 15 

gca tec tea ggc tgg aat aaa aat aat ggc ggc ate ata ctt ccg ace 96 
Ala Ser Ser Gly Trp Asn Lys Asn Asn Gly Gly He He Leu Pro Thr 
20 25 30 

ttt aga cct cca cct ata tgg cca gga att ace agg aca gta cgt gaa 144 
Phe Arg Pro Pro Pro He Trp Pro Gly He Thr Arg Thr Val Arg Glu 
35 40 45 

get gga gat caa cct tta tgg ctg tac caa gga gac aat cac ccg cga 192 
Ala Gly Asp Gin Pro Leu Trp Leu Tyr Gin Gly Asp Asn His Pro Arg 
50 55 60 

gee cct tea age ggc gat cat cct gta ctg ccc teg ate ata gac gat 240 
Ala Pro Ser Ser Gly Asp His Pro Val Leu Pro Ser He He Asp Asp 
65 70 75 80 

gtg aag ttg gac ccc aac agg egg tac gtg cgt agt gtg aac gag ccg 288 
Val Lys Leu Asp Pro Asn Arg Arg Tyr Val Arg Ser Val Asn Glu Pro 
85 90 95 

teg tea cag gag cat cac gaa cgc ttt gtg agg age ttc gac tec cgc 336 
Ser Ser Gin. Glu His His Glu Arg Phe Val Arg Ser Phe Asp Ser Arg 
100 105 110 

age age agg cat cac ggc ggc age cac tct acg tec age ggc age cgc 384 
Ser Ser Arg His His Gly Gly Ser His Ser Thr Ser Ser Gly Ser Arg 
115 120 125 

gac act gga get act cat ccg gga tac aat cgt cgt aac tea taa 429 
Asp Thr Gly Ala Thr His Pro Gly Tyr Asn Arg Arg Asn Ser * 
130 135 140 

tctgtggttt aatccattag aaatttgtgt ttgtattttg ataaaaacaa tg 481 

<210> 52 
<211> 142 
<212> PRT 

<213> Heliothis virescens 
<400> 52 

Met Lys Ser Val Leu Val Leu Cys Val Val Ala Val Leu His Thr Ala 

15 10 15 

Ala Ser Ser Gly Trp Asn Lys Asn Asn Gly Gly He lie Leu Pro Thr 

20 25 30 

Phe Arg Pro Pro Pro He Trp Pro Gly He Thr Arg Thr Val Arg Glu 
35 40 45 
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Ala Gly A3p Gin 
50 

Ala Pro Ser Ser 
65 

Val Lys Leu Asp 

Ser Ser Gin Glu 
100 

Ser Ser Arg His 
115 

Asp Thr Gly Ala 
130 



Pro Leu Trp Leu 
55 

Gly Asp His Pro 
70 

Pro Asn Arg Arg 
85 

His His Glu Arg 

His Gly Gly Ser 
120 

Thr His Pro Gly 
135 



Tyr Gin Gly Asp 
60 

Val Leu Pro Ser 
75 

Tyr Val Arg Ser 
90 

Phe Val Arg Ser 
105 

His Ser Thr Ser 

Tyr Asn Arg Arg 
140 



Asn His Pro Arg 

lie lie Asp Asp 
80 

Val Asn Glu Pro 

95 

Phe Asp Ser Arg 
110 

Ser Gly Ser Arg 
125 

Asn Ser 



<210> 53 
<211> 142 
<212> PRT 

<213> Heliothis virescens 



<400> 53 






















Met 


Lys 


Ser 


Val 


Leu 


Val 


Leu 


Cys 


Val Val 


Ala Val Leu 


His 


Thr 


Ala 


1 








5 








10 






15 




Ala 


Ser 


Ser 


Gly 


Trp 


Asn 


Lys Asn 


Asn Gly Gly He He 


Leu 


Pro 


Thr 


Phe 






20 










25 




30 






Arg 


Pro 


Pro 


Pro 


lie 


Trp 


Pro 


Gly He 


Thr Arg Thr Val 


Arg 


Glu 






35 










40 




45 






Ala 


Gly Asp 


Gin 


Pro 


Leu 


Trp 


Leu 


Tyr Gin Gly Asp Asn His 


Pro 


Arg 




50 










55 






60 






Ala 

65 


Pro 


Ser 


Ser Gly Asp 
70 


His 


Pro 


Val Leu 


Pro Ser He 
75 


He 


Asp Asp 
80 


val 


Lys 


Leu 


Asp 


Pro 


Asn 


Arg 


Arg 


Tyr Val 


Arg Ser Val 


Asn 


Glu 


Pro 


Ser 








85 








90 






95 




Ser 


Gin 


Glu 


His 


His 


Glu 


Arg 


Phe Val 


Arg Ser Phe 


Asp 


Ser 


Arg 


Ser 






100 










105 




110 




Ser 


Arg 


His 


His 


Gly 


Gly 


Ser 


His Ser 


Thr Ser Ser 


Gly 


Ser 


Arg 






115 










120 




125 




Asp 


Thr 
130 


Gly 


Ala 


Thr 


His 


Pro 
135 


Gly 


Tyr Asn 


Arg Arg Asn 

140 


Ser 







<210> 54 
<211> 418 
<212> DNA 

<213> Heliothis virescens 

<220> 

<221> CDS 

<222> (1) . . . (192) 

<400> 54 

atg aat tct aaa ata gtg att ttt ttg tgc att tgt ttt gtt ctt gtg 48 

Met Asn Ser Lys He Val He Phe Leu Cys He Cys Phe Val Leu Val 
1 5 10 15 

tea acg gca acg gca tgg ga t ttg ttt aaa gaa att gag gga gca ggt 96 
Ser Thr Ala Thr Ala Trp Asp Leu Phe Lys Glu He Glu Gly Ala Gly 
20 25 30 

cag agg gtg cgt gat gec ate ate age get ggc cct gcg gtc gac gtg 14 4 
Gin Arg Val Arg Asp Ala He He Ser Ala Gly Pro Ala Val Asp Val 
35 40 45 

etc ace aaa act aaa gga tta ttc gac age tct gaa gaa aaa gat tag 192 
Leu Thr Lys Thr Lys Gly Leu Phe Asp Ser Ser Glu Glu Lys Asp * 
50 55 60 
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tttataataa aatgtaaact cagcttagat taggtacaga cgctagecgg tcaacgtacc 252 

aacgtctgtc aaattttacc aatcgaactt taaccttcca ctgttgtgat aaggttgaaa 312 

atctattgag gaaatttgtc agattgtgat ttgccaggtc gacgtgttgg tatctgtaaa 372 

tttatacttt cattaagtaa tattgtagct gtaacactga aagaac 418 

<210> 55 
<211> 57 
<212> PRT 

<213> Heliothis virescens 
<400> 55 

Met Asn Ser Lys lie Val He 

1 5 
Ser Thr Ala Thr Ala Trp Asp 
20 

Gin Arg Val Arg Asp Ala He 
35 

Leu Thr Lys Thr Lys Gly Leu 
50 55 



Phe Leu Cys 
10 

Leu Phe Lys 

25 

He Ser Ala 
40 

Phe Asp 



He Cys Phe 

Glu He Glu 

Gly Pro Ala 
45 



Val Leu Val 
15 

Gly Ala Gly 
30 

Val Asp Val 



<210> 56 
<211> 63 
<212> PRT 

<213> Heliothis virescens 
<400> 56 

Met Asn Ser Lys He Val He Phe Leu Cys He Cys Phe Val Leu Val 

15 10 15 

Ser Thr Ala Thr Ala Trp Asp Leu Phe Lys Glu He Glu Gly Ala Gly 

20 * 25 30 

Gin Arg Val Arg Asp Ala He He Ser Ala Gly Pro Ala Val Asp Val 

35 40 45 

Leu Thr Lys Thr Lys Gly Leu Phe Asp Ser Ser Glu Glu Lys Asp 
50 55 60 



<210> 57 
<211> 275 
<212> DNA 

<213> Heliothis virescens 

<220> 

<221> CDS 

<222> (1) . . . (189) 

<400> 57 

atg aac ttc tea agg ata ttt ttc ttc gtg ttc gcg tgt ttg gta gta 48 

Met Asn Phe Ser Arg He Phe Phe Phe Val Phe Ala Cys Leu Val Val 

15 10 15 

ctg tgc age gtg teg gcg gcg cct gag ccg agg tgg aag gtc ttc aag 96 
Leu Cys Ser Val Ser Ala Ala Pro Glu Pro Arg Trp Lys Val Phe Lys 
20 25 30 

aaa att gag aag atg ggt cgc aac ate cga gac ggc ate gta aag get 144 
Lys He Glu Lys Met Gly Arg Asn He Arg Asp Gly He Val Lys Ala 
35 40 45 

gga cca gcg ata gca gtt etc ggc caa get aaa gca tta gga taa 189 
Gly Pro Ala He Ala Val Leu Gly Gin Ala Lys Ala Leu Gly * 
50 55 60 

ataattattg tattattaat attaagagtt taatatctaa gtegcattta aatactcatt 249 
ctgccataaa taaatgtatt ttaagt 275 
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<210> 58 
<211> 6'2 
<212> PRT 

<213> Heliothis virescens 



<400> 58 
Met Asn Phe Ser 
1 

Leu Cys Ser Val 
20 

Lys lie Glu Lys 
35 

Gly Pro Ala lie 
50 



Arg lie Phe Phe 
5 

Ser Ala Ala Pro 

Met Gly Arg Asn 
40 

Ala Val Leu Gly 
55 



Phe Val Phe Ala 
10 

Glu Pro Arg Trp 
25 

lie Arg Asp Gly 

Gin Ala Lys Ala 
60 



Cys Leu Val Val 
15 

Lys Val Phe Lys 
30 

lie Val Lys Ala 
45 

Leu Gly 



<210> 59 
<211> 62 
<212> PRT 
<213> Heliothis 



virescens 



<400> 59 

Met Asn Phe Ser Arg lie Phe Phe 

1 5 
Leu Cys Ser Val Ser Ala Ala Pro 
20 

Lys lie Glu Lys Met Gly Arg Asn 

35 " 40 

Gly Pro Ala He Ala Val Leu Gly 

50 . 55 



Phe Val Phe Ala Cys Leu Val Val 

10 15 
Glu Pro Arg Trp Lys Val Phe Lys 
25 30 
He Arg Asp Gly He Val Lys Ala 
45 

Gin Ala Lys Ala Leu Gly 
60 



<210> 60 
<211> 397 
<212> DNA 

<213> Helicoverpa zea 



<220> 



<221> CDS 

<222> (1) . . . (192) 

<221> misc_feature 
<222> 229, 267, 326 
<223> n = A,T,C or G 



<400> 60 

atg aat tec aaa att gta tta ttc 

Met Asn Ser Lys lie Val Leu Phe 

1 5 

teg acg gca aca gca tgg gac ttc 
Ser Thr Ala Thr Ala Trp Asp Phe 
20 



ctg tgt gtt tgt ttg gtg ctt gtg 48 
Leu Cys Val Cys Leu Val Leu Val 
10 is 

ttt aag gaa ctt gaa gga gca gga 96 
Phe Lys Glu Leu Glu Gly Ala Gly 
25 30 



caa aga gtc cgc gat get ate ate age get ggc cct get gtc gac gtt 144' 
Gin Arg Val Arg Asp Ala He He Ser Ala Gly Pro Ala Val Asp Val 
35 40 45 

etc acc aaa get aag ggg eta tac gac age tec gaa gaa aaa gat tag 192 
Leu Thr Lys Ala Lys Gly Leu Tyr Asp Ser Ser Glu Glu Lys Asp * 
50 55 60 

gatataagee aatcaaatca tcatcatcat agtcaanaat caatcaaaat caaaactcat 252 
ttattcaaac ttggntgcaa aacaagcact tttcgaacgt caaaaaaaaa tttacataag 312 
acagcccccc aatncgccca cccttcacca acttccctaa gttgtttttt gctggggaaa 372 
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gaaagaagtt ggcgcaacaa aacct 3 97 

<210> 61 
<211> 63 
<212> PRT 

<213> Heliocoverpa zea 
<400> 61 

Met Asn Ser Lys lie Val Leu Phe Leu Cys Val Cys Leu Val Leu Val 

1 5 10 15 

Ser Thr Ala Thr Ala Trp Asp Phe Phe Lys Glu Leu Glu Gly Ala Gly 

20 25 30 

Gin Arg Val Arg Asp Ala He He Ser Ala Gly Pro Ala Val Asp Val 

35 40 45 

Leu Thr Lys Ala Lys Gly Leu Tyr Asp Ser Ser Glu Glu Lys Asp 
50 55 60 



<210> 62 
<211> 57 
<212> PRT 

<213> Heliocoverpa zea 
<400> 62 

Met Asn Ser Lys He Val Leu Phe Leu Cys Val Cys Leu Val Leu Val 

1 5 10 15 

Ser Thr Ala Thr Ala Trp Asp Phe Phe Lys Glu Leu Glu Gly Ala Gly 

20 25 30 

Gin Arg Val Arg Asp Ala lie He Ser Ala Gly Pro Ala Val Asp Val 

35 40 45 

Leu Thr Lys Ala Lys Gly Leu Tyr Asp 
50 55 



<210> 63 
<211> 263 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) . . . (186) 

<221> misc_f eature 
<222> 56, 65, 108, 123 
<223> n = A,T,C or G 

c400> 63 

atg aac fctc tct cgc gtt ttg ttc ttc gtg ttt get tgc gtc age gca 48 

Met Asn Phe Ser Arg Val Leu Phe Phe Val Phe Ala Cys Val Ser Ala 
15 10 15 

ttc gec gng act tea gnt gcg ccc tgt aat ccc ttt aag gaa ctg gag 96 
Phe Ala Xaa Thr Ser Xaa Ala Pro Cys Asn Pro Phe Lys Glu Leu Glu 
20 25 30 

aga get ggc can cga gtc cgc gac gen gtc ate age gee gcg cct gca 144 
Arg Ala Gly Xaa Arg Val Arg Asp Ala Val He Ser Ala Ala Pro Ala 
35 40 45 

gtc gcg ace gtc gga cag gcg gee gec ate gee age gga taa 186 
Val Ala Thr Val Gly Gin Ala Ala Ala He Ala Ser Gly * 
50 55 60 

taaccaatgg atgettcact attcattatt atcataaatt atatgtgcca taccttaata 246 
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tgttccttac atttgta „_ 

2 o 3 

<210> 64 
<211> 61 
<212> PRT 

<213> Manduca sexta 
<220> 

<221> VARIANT 

<222:> 19, 22, 36 

<223> Xaa = Any Amino Acid 

<400> 64 

Met Asn Phe Ser Arg Val Leu Phe Phe Val Phe Ala Cys Val Ser Ala 

Phe Ala Xaa Thr Ser Xaa Ala Pro Cys Asn Pro Phe Lys Glu llu Glu 

20 25 30 

Arg Ala Gly Xaa - Arg Val Arg Asp Ala Val He Ser Ala Ala Pro Ala 

35 40 45 

Val Ala Thr Val Gly Gin Ala Ala Ala He Ala Ser Gly 

S ° 55 60 

<210> 65 
<211> 61 
<212> PRT 

<r213> Manduca sexta 
<220> 

<221> VARIANT 

<222> 19, 22, 36 

<223> Xaa = Any Amino Acid 



<400> 65 

Met Asn Phe Ser Arg Val Leu Phe Phe Val Phe Ala Cys Val Ser Ala 

Z 5 10 15 

Phe Ala Xaa Thr Ser Xaa Ala Pro Cys Asn Pro Phe Lys Glu Leu Glu 

20 25 30 

Arg Ala Gly Xaa Arg Val Arg Asp Ala Val He Ser Ala Ala Pro Ala 

35 40 45 

Val Ala Thr Val Gly Gin Ala Ala Ala He Ala Ser Gly 
50 55 60 

<210> 66 
<211> 367 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) . . . (186) 

<400> 66 

atg aac ttc tec agg ate ttc ttc ttc gtc ttc gec ttq qtt ctt oar 4 a 

Met Asn Phe Ser Arg lie Phe Phe Phe Val Phe La let vll Leu K 
5 10 15 

atg tct get gta tea gca get ccc aaa tgg aag att ttt aaq aaa afct- 
Met Ser Ala Val Ser Ala Ala Pro Lys Tr? Lys lie HI Lys Lys lie 
20 25 30 

gaa aaa gtc gga agg aac gtc cgt gat ggt att ate aaa gcg gga C ca 144 
Glu Lys Val Gly Arg Asn Val Arg Asp Gly He lie Lys III lly Pro 
^ 5 40 45 
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gcg ata caa gtg ctg gga cag gcg aaa gcg att gga aaa tga 
Ala He Gin Val Leu Gly Gin Ala Lys Ala He Gly Lys * 
50 55 ^ 60 



186 



agctgtattg cagtgttctt aaagtcttta ttacctcaac aaaatgccat aactgtatac 246 
tcttatagat aagtgaatca gaagaatgat ctgatgtaga gataatgaat ctgcctgtat 306 
ttctttgaat aaattaagtg aatgtaaata tttttttaaa taaataattt ttattaatct 366 
t 367 

<210> 67 
<211> 61 
<212> PRT 

<213> Manduca sexta 



<400> 67 

Met Asn Phe Ser Arg He Phe Phe Phe Val Phe Ala Leu Val Leu Gly 

1 5 10 15 

Met Ser Ala Val Ser Ala Ala Pro Lys Trp Lys He Phe Lys Lys He 

20 25 30 

Glu Lys Val Gly Arg Asn Val Arg Asp Gly He He Lys Ala Gly Pro 

35 40 45 

Ala He Gin Val Leu Gly Gin Ala Lys Ala He Gly Lys 
50 55 60 



<210> 68 
<211> 61 
<212> PRT 

<213> Manduca sexta 
<400> 68 

Met Asn Phe Ser Arg He Phe Phe Phe Val Phe Ala Leu Val Leu Gly 

15 io 15 

Met Ser Ala Val Ser Ala Ala Pro Lys Trp Lys He Phe Lys Lys He 

20 25 30 

Glu Lys Val Gly Arg Asn Val Arg Asp Gly He He Lys Ala Gly Pro 

3 5 4 0 45 

Ala He Gin Val Leu Gly Gin Ala Lys Ala He Gly Lys 
50 55 60 



<210> 69 
<211> 230 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) . . . (135) 

<400> 69 

atg get tea get gca cct tgg aat ccc ttc aag gag ctg gag aga get 4 8 
Met Ala Ser Ala Ala Pro Trp Asn Pro Phe Lys Glu Leu Glu Arg Ala 
1 5 10 15 

ggt cag cga gtc cgc gac gec ate ate age gca ggc cca gca gtc gcg 96 
Gly Gin Arg Val Arg Asp Ala He He Ser Ala Gly Pro Ala Val Ala 
20 25 30 

acc gtc gga cag gcg gec get ate gec agg ggt ggt taa gcaacgaatg 145 
Thr Val Gly Gin Ala Ala Ala He Ala Arg Gly Gly * 
35 40 

ctttatctat gaatatgett attaattata taagtttcat gtatctttat tacaataatg 205 
atttggtata ataaaegtea ataat 230 
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<210> 70 
<211> 44 
<212> PRT 

<213> Manduca sexta 
<400> 70 

Met Ala Ser Ala Ala Pro Trp Asn Pro Phe Lys Glu Leu Glu Arg Ala 

15 10 15 

Gly Gin Arg Val Arg Asp Ala lie He Ser Ala Gly Pro Ala Val Ala 

20 25 30 

Thr Val Gly Gin Ala Ala Ala He Ala Arg Gly Gly 
3 5 40 



<210> 71 
<211> 44 
<212> PRT 

<213> Manduca sexta 



<400> 71 

Met Ala Ser Ala Ala Pro Trp Asn Pro Phe Lys Glu Leu Glu Arg Ala 

1 5 10 15 

Gly Gin Arg Val Arg Asp Ala He He Ser Ala Gly Pro Ala Val Ala 

20 25 30 

Thr Val Gly Gin Ala Ala Ala He Ala Arg Gly Gly 
35 4Q 



<210> 72 
<211> 287 
<212> DNA 

<213> Manduca sexta 

<220> 
<221> CDS 

<222> (25) . . . (287) 



<400> 72 

actagtggat cccccgggct gcag ggt gaa aca ate atg aaa ttg eta ctg 51 

Gly Glu Thr lie Met Lys Leu Leu Leu 
1 5 

att ttg ggc gtt gcg ctg gtg ttg etc ttt ggt gag tec tta ggt cag 99 

He Leu Gly Val Ala Leu Val Leu Leu Phe Gly Glu Ser Leu Gly Gin 
10 15 



2 0 25 



147 



a™ ph" * 9C CCt a ° 9 ttC aa9 Cta Cct Caa 99t aga ttg aca ctt 

Arg Phe Ser Gin Pro Thr Phe Lys Leu Pro Gin Gly Arg Leu Thr Leu 
30 35 ~ 40 

agt cga aaa ttt agg gag tec ggc aat gag cca cta tgg ttg tat caa 195 
Ser Arg Lys Phe Arg Glu Ser Gly Asn Glu Pro Leu Trp Leu Tyr Gin 
45 50 55 

ggc gac aac ata cca aag gca cca tea act gca gaa cat ccc ttc ctt 243 
Gly Asp Asn He Pro Lys Ala Pro Ser Thr Ala Glu His Pro Phe Leu 
60 65 7 o 

ccg tct ata ata gat gat gtg aag ttc aat cca gat aga aga ta 287 
Pro Ser He He Asp Asp Val Lys Phe Asn Pro Asp Arg Arg 
75 80 85 



<210> 73 
<211> 87 



32 
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<212> PRT 

<213> Manduca sexta 
<400> 73 



Gly 


Glu 


Thr 


He 


Met 


Lys 


Leu 


Leu 


Leu 


He 


Leu 


Gly Val 


Ala 


Leu 


Val 


1 








5 










10 










15 




Leu 


Leu 


Phe 


Gly 


Glu 


Ser 


Leu 


Gly 


Gin 


Arg 


Phe 


Ser 


Gin 


Pro 


Thr 


Phe 








20 










25 










30 






Lys 


Leu 


Pro 


Gin Gly 


Arg 


Leu 


Thr 


Leu 


Ser 


Arg 


Lys 


Phe 


Arg 


Glu 


Ser 






35 










40 










45 








Gly 


Asn 


Glu 


Pro 


Leu 


Trp 


Leu 


Tyr Gin Gly Asp Asn 


He 


Pro 


Lys 


Ala 




50 










55 










60 










Pro 


Ser 


Thr 


Ala 


Glu 


His 


Pro 


Phe 


Leu 


Pro 


Ser 


He 


He 


Asp 


Asp 


Val 


65 










70 










75 










80 


Lys 


Phe 


Asn 


Pro 


Asp 


Arg 


Arg 





















85 



<210> 74 
<211> 87 
<212> PRT 

<213> Manduca sexta 
<400> 74 



Gly 


Glu 


Thr 


He Met 


Lys 


Leu 


Leu 


Leu 


He 


Leu 


Gly Val 


Ala 


Leu 


Val 


1 






5 










10 










15 




Leu 


Leu 


Phe 


Gly Glu 


Ser 


Leu 


Gly Gin Arg 


Phe 


Ser 


Gin 


Pro 


Thr 


Phe 








20 








25 










30 






Lys 


Leu 


Pro 


Gin Gly 


Arg 


Leu 


Thr 


Leu 


Ser 


Arg 


Lys 


Phe 


Arg 


Glu 


Ser 






35 








40 










45 








Gly 


Asn 


Glu 


Pro Leu 


Trp 


Leu 


Tyr 


Gin 


Gly Asp 


Asn 


He 


Pro 


Lys 


Ala 




50 








55 










60 










Pro 


Ser 


Thr 


Ala Glu 


His 


Pro 


Phe 


Leu 


Pro 


Ser 


He 


He 


Asp Asp 


Val 


65 








70 










75 










80 


Lys 


Phe 


Asn 


Pro Asp 


Arg 


Arg 





















85 



<210> 75 
<211> 220 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) . . . (192) 

<400> 75 

atg aac ttc tec cgc att ttc ttc ttt gtg ttc get ctg gtc etc agt 48 

Met Asn Phe Ser Arg He Phe Phe Phe Val Phe Ala Leu Val Leu Ser 
15 10 15 

ctg teg gcg gtg tec gcg get cct gaa ccg aaa tgg aag gtg ttt aag 96 
Leu Ser Ala Val Ser Ala Ala Pro Glu Pro Lys Trp Lys Val Phe Lys 
20 25 30 

aaa att gaa aaa atg ggc cga aat ate aga gat gga att ate aaa get 144 
Lys He Glu Lys Met Gly Arg Asn He Arg Asp Gly He He Lys Ala 
35 40 * 45 

ggc cca gcg att gaa gtc ctt ggc gca get aag gec ata gga aag tga 192 
Gly Pro Ala He Glu Val Leu Gly Ala Ala Lys Ala He Gly Lys * 
50 55 60 

acctaatget tccttgttag tctatttt 220 
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<210> 76 
<211> 63 
<212> PRT 

<213> Manduca sexta 
<400> 76 

Met Asn Phe Ser Arg He Phe Phe Phe Val Phe Ala Leu Val Leu Ser 

1 5 io 15 

Leu Ser Ala Val Ser Ala Ala Pro Glu Pro Lys Trp Lys Val Phe Lys 

20 25 30 

Lys He Glu Lys Met Gly Arg Asn He Arg Asp Gly He He Lys Ala 

35 40 45 

Gly Pro Ala He Glu Val Leu Gly Ala Ala Lys Ala He Gly Lys 
50 55 60 

<210> 77 
<211> 63 
<212> PRT 

<213> Manduca sexta 
<400> 77 

Met Asn Phe Ser Arg He Phe Phe Phe Val Phe Ala Leu Val Leu Ser 

1 5 io i 5 

Leu Ser Ala Val Ser Ala Ala Pro Glu Pro Lya Trp Lys Val Phe Lys 

20 25 30 

Lys He Glu Lys Met Gly Arg Asn He Arg Asp Gly He He Lys Ala 

35 40 45 

Gly Pro Ala He Glu Val Leu Gly Ala Ala Lys Ala He Gly Lys 
50 55 60 

<210> 78 

<211> 293 

<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) . . . (279) 

<400> 78 

atg aat tta tta tat ttc ctt teg ttt ctg ggc tgt att act etc tgc 48 

Met Asn Leu Leu Tyr Phe Leu Ser Phe Leu Gly Cys He Thr Leu Cys 
1 5 10 ■ 15 

ttg agt gec ggt ttg tac aaa cct cct aat aac ata gaa tct gag aac 96 
Leu Ser Ala Gly Leu Tyr Lys Pro Pro Asn Asn He Glu Ser Glu Asn 
20 25 30 

nf* 2 fc ? 1*° 9 ? a aat atC t9C ttc tfc 3 cca fct 9 9aa gtt ggg gta 144 

Glu Val Tyr Thr Gly Asn He Cys Phe Leu Pro Leu Glu Val Gly Val 
35 4 o 45 

tgc cga get ctg ttc ttt agg tac gga tac gat cca gcg ata aag gca 192 
Cys Arg Ala Leu Phe Phe Arg Tyr Gly Tyr Asp Pro Ala He Lys Ala 
50 55 go 

tgc aag gaa ttc atg tac ggc ggt tgc caa ggg aac get aac aat ttc 240 
Cys Lys Glu Phe Met Tyr Gly Gly Cys Gin Gly Asn Ala Asn Asn Phe 
65 7 ° 75 80 



aag act tta gaa gaa tgc cag gaa gee tgt gaa gec taa gtacctggac 
Lys Thr Leu Glu Glu Cys Gin Glu Ala Cys Glu Ala * 
85 90 

-34- 
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ttcg 

<210> 79 
<211> 92 
<212> PRT 

<213> Manduca sexta 



<400> 79 



Met 


Asn 


Leu 


Leu Tyr 


Phe Leu 


Ser 


1 






5 






Leu 


Ser 


Ala 


Gly Leu 


Tyr Lys 


Pro 








20 






Glu 


Val 


Tyr 


Thr Gly Asn lie 


Cys 






35 






40 


Cys 


Arg 


Ala 


Leu Phe 


Phe Arg 


Tyr 




50 






55 




Cys 


Lys 


Glu 


Phe Met 


Tyr Gly Gly 


65 








70 




Lys 


Thr 


Leu 


Glu Glu 


Cys Gin 


Glu 



85 



293 



Phe Leu Gly Cys lie Thr Leu Cys 

10 15 
Pro Asn Asn lie Glu Ser Glu Asn 
25 30 
Phe Leu Pro Leu Glu Val Gly Val 
45 

Gly Tyr Asp Pro Ala lie Lys Ala 
60 

Cys Gin Gly Asn Ala Asn Asn Phe 

75 80 
Ala Cys Glu Ala 
90 



<210> 80 
<211> 92 
<212> PRT 

<213> Manduca sexta 
<400> 80 

Met Asn Leu Leu Tyr Phe Leu Ser 

1 5 
Leu Ser Ala Gly Leu Tyr Lys Pro 
20 

Glu Val Tyr Thr Gly Asn lie Cys 

35 40 
Cys Arg Ala Leu Phe Phe Arg Tyr 

50 55 
Cys Lys Glu Phe Met Tyr Gly Gly 
65 70 
Lys Thr Leu Glu Glu Cys Gin Glu 
85 



Phe Leu Gly Cys lie Thr Leu Cys 

10 . 15 
Pro Asn Asn lie Glu Ser Glu Asn 
25 30 
Phe Leu Pro Leu Glu Val Gly Val 
45 

Gly Tyr Asp Pro Ala lie Lys Ala 
60 

Cys Gin Gly Asn Ala Asn Asn Phe 

75 80 
Ala Cys Glu Ala 
90 



<210> 81 
<211> 489 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) . . . (489) 

<400> 81 

atg aaa ttg eta ctg att ttg ggc gtt gcg ctg gtg ttg etc ttt ggt 4 8 
Met Lys Leu Leu Leu lie Leu Gly Val Ala Leu Val Leu Leu Phe Gly 
1 5 10 15- 

gag tec tta ggt cag cga ttt age cag cct acg ttc aag eta cct caa 96 
Glu Ser Leu Gly Gin Arg Phe Ser Gin Pro Thr Phe Lys Leu Pro Gin 
20 25 30 

ggt aga ttg aca ctt agt cga aaa ttt agg gag tec ggc aat gag cca 144 
Gly Arg Leu Thr Leu Ser Arg Lys Phe Arg Glu Ser Gly Asn Glu Pro 
35 40 45 

eta tgg ttg tat caa ggc gac aac ata cca aag gca cca tea act gca 192 
Leu Trp Leu Tyr Gin Gly Asp Asn lie Pro Lys Ala Pro Ser Thr Ala 
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50 55 60 



gaa cat ccc ttc ctt ccg tct ata ata gat gat gtg aag ttc aat cca 
Glu His Pro Phe Leu Pro Ser He He Asp Asp Val Lys Phe Asn Pro 
6S 70 75 so 

gat aga aga tac gcg cgc agt ctt ggt aca cca gac cat tat cat gga 
Asp Arg Arg Tyr Ala Arg Ser Leu Gly Thr Pro Asp His Tyr His Gly 
85 90 95 



240 



288 



ggc cgt cat tec ata tct cga ggt age cag age aca gga ccg act cat 336 
Gly Arg Hi3 Ser He Ser Arg Gly Ser Gin Ser Thr Gly Pro Thr His 
100 105 * 110 

ccg ggc tat aat cgc cgt aac gec agg agt gtc gaa acg tta get age 384 
Pro Gly Tyr Asn Arg Arg Asn Ala Arg Ser Val Glu Thr Leu Ala Ser 
115 120 125 

caa gaa cat eta age age ctg ccg atg gat age caa gag act tta ctg 
Gin Glu His Leu Ser Ser Leu Pro Met Asp Ser Gin Glu Thr Leu Leu 
130 135 140 

cgt ggc ace agg age gtg gaa aca eta get agt cag gaa cat eta age 
Arg Gly Thr Arg Ser Val Glu Thr Leu Ala Ser Gin Glu His Leu Ser 
145 150 155 160 



432 



480 



age ctg ccg 489 
Ser Leu Pro 



<210> 82 
<211> 163 
<212> PRT 

<213> Manduca sexta 
<400> 82 

Met Lys Leu Leu Leu He Leu Gly Val Ala Leu Val Leu Leu Phe Gly 

1 5 io i 5 

Glu Ser Leu Gly Gin Arg Phe Ser Gin Pro Thr Phe Lys Leu Pro Gin 

20 25 30 

Gly Arg Leu Thr Leu Ser Arg Lys Phe Arg Glu Ser Gly Asn Glu Pro 

35 40 45 

Leu Trp Leu Tyr Gin Gly Asp Asn He Pro Lys Ala Pro Ser Thr Ala 

50 55 60 

Glu His Pro Phe Leu Pro Ser He He Asp Asp Val Lys Phe Asn Pro 
65 70 75 ' so 

Asp Arg Arg Tyr Ala Arg Ser Leu Gly Thr Pro Asp His Tyr His Gly 

85 90 95 

Gly Arg His Ser He Ser Arg Gly Ser Gin Ser Thr Gly Pro Thr His 

100 105 no 

Pro Gly Tyr Asn Arg Arg Asn Ala Arg Ser Val Glu Thr Leu Ala Ser 

115 120 . 125 

Gin Glu His Leu Ser Ser Leu Pro Met Asp Ser Gin Glu Thr Leu Leu 

i3 ° 135 i 4 o 

Arg Gly Thr Arg Ser Val Glu Thr Leu Ala Ser Gin Glu His Leu Ser 

145 150 155 160 

Ser Leu Pro 



<210> 83 
<211> 165 
<212> PRT 

<213> Manduca sexta 
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<400> 83 



Met 


Lys 


Leu 


Leu 


Leu 


lie 


Leu 


Gly 


Val 


Ala 


Leu 


Val 


Leu 


Leu 


Phe 


Gly 


1 








5 










10 










15 




Glu 


Ser 


Leu Gly 


Gin 


Arg 


Phe 


Ser 


Gin 




Thr 


Phe 


Ly s 


Leu 


Prn 

*r JL Vj 


Gin 








20 










25 










30 






Glv 


Ara 


Leu 


Thr 


Leu 


Ser 


Arg 


Lys 


Phe 


Arg 


Glu 


Ser 


Glv 


Asn 


Glu 


Pro 






35 










40 










45 








Leu 


1 1{J 


Leu 


i Y r 


Gin 


ox y 




Asn 


T i ~ 

IXC 


pro 


Ly s 


Al a 


ri-U 


Del 


TVlY* 
1 ill 


Al a 

Ml u 




50 










55 










60 










Glu 


His 


Pro 


rile 


Leu 


Pro 




lie 


Tip 

lie 


Asp 


Asp 


Val 


Ly s 




Asn 


Pro 


65 




















/ D 










on 
o w 


Asp 


Arg 




i y £ 


Aid 


Arg 


oer 


Leu 


uiy 


inr 


Pro 


Asp 


nlS 


Tyr 


HIS 


vj±y 


































Lily 


Ar*g 


His 


Ser 


Tie* 

jl j. e 


Ser 


Arg 


r~* ~l ,,. 

L»j.y 


Ser 


tain 


Ser 


i nr 


Lrl y 


Pro 


Thr 


Hi s 








100 










105 










110 






Pro 


Gly 


Tyr 


Asn 


Arg 


Arg 


Asn 


Ala 


Arg 


Ser 


Val 


Glu 


Thr 


Leu 


Ala 


Ser 






115 










120 










125 








Gin 


Glu 


His 


Leu 


Ser 


Ser 


Leu 


Pro 


Met 


Asp 


Ser 


Gin 


Glu 


Thr 


Leu 


Leu 




130 










135 










140 










Arg 


Gly 


Thr 


Arg 


Ser 


Val 


Glu 


Thr 


Leu 


Ala 


Ser 


Gin 


Glu 


His 


Leu 


Ser 


145 










150 










155 










160 


Ser 


Leu 


Pro 


Met 


Asp 

























165 



<210> 84 
<211> 475 
<212> DNA 

<213> Manduca sexta 

<220> 

<:221> CDS 

<222> (2) . . . (475) 

<221> misc_f eature 
<222> 12, 13, 14 
<223> n - A,T,C or G 

<400> 84 

g ccg etc tag ann ngt gga tec ccc ggg ctg cag gca aaa tec aat ttc 4 9 

Pro Leu * Xaa Xaa Gly Ser Pro Gly Leu Gin Ala Lys Ser Asn Phe . 
1 5 10 15 

gcg ctt gga gtt ate gca att ctg tta ata aca gaa tec aac tgt tgg 97 
Ala Leu Gly Val lie Ala lie Leu Leu lie Thr Glu Ser Asn Cys Trp 

20 25 30 * 

aga agt gat etc cct ate ata etc ccg act tat aaa cct cct cgt acc 145 
Arg Ser Asp Leu Pro lie He Leu Pro Thr Tyr Lys Pro Pro Arg Thr 
35 40 45 

ccg age acc att att ate agg aca gta cgc gaa gec gga gat aaa ccg 193 
Pro Ser Thr He lie He Arg Thr Val Arg Glu Ala Gly Asp Lys Pro 
50 55 60 

tta tgg etc tac caa gga gac gat cac ccg caa gec cct tea age ggc 241 
Leu Trp Leu Tyr Gin Gly Asp Asp His Pro Gin Ala Pro Ser Ser Gly 
65 70 75 

gat cat cct gta ctg ccc teg att ata gac gat gtg caa ctg gat ccc 289 
Asp His Pro Val Leu Pro Ser He He Asp Asp Val Gin Leu Asp Pro 
80 85 90 95 

aac aga egg tac gcg cgt agt gtg age gag ccg teg tct cag gat cat 33 7 
Asn Arg Arg Tyr Ala Arg Ser Val Ser Glu Pro Ser Ser Gin Asp His 
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100 105 no 

cac gaa cgc ttt gtg agg age ttc gac tec cgc age age aag cat cac 3 85 

Has Glu Arg Phe Val Arg Ser Phe Asp Ser Arg Ser Ser Lys His His 
115 120 125 

ggc ggc agt cac tec acg tec age ggc age cgc gac act gga get act 433 

Gly Gly Ser His Ser Thr Ser Ser Gly Ser Arg Asp Thr Gly Ala Thr 

130 135 140 



cat ccg gga tac aat cgc cgt aac tea taa tct gtg gtt taa 
His Pro Gly Tyr Asn Arg Arg Asn Ser * Ser Val Val * 
14 5 iso 155 



<210> 85 
<211> 141 
<212> PRT 

<213> Manduca sexta 
<400> 85 

Lys Ser Asn Phe Ala Leu Gly Val lie Ala He Leu Leu He Thr Glu 

1 5 io i 5 

Ser Asn Cys Trp Arg Ser Asp Leu Pro He He Leu Pro Thr Tyr Lys 

20 25 30 

Pro Pro Arg Thr Pro Ser Thr He He He Arg Thr Val Arg Glu Ala 

35 40 45 

Gly Asp Lys Pro Leu Trp Leu Tyr Gin Gly Asp Asp His Pro Gin Ala 

50 55 so 

Pro Ser Ser Gly Asp His Pro Val Leu Pro Ser He He Asp Asp Val 
65 70 7 5 QQ 

Gin Leu Asp Pro Asn Arg Arg Tyr Ala Arg Ser Val Ser Glu Pro Ser 

85 90 95 

Ser Gin Asp His His Glu Arg Phe Val Arg Ser Phe Asp Ser Arq Ser 

100 los 110 

Ser Lys His His Gly Gly Ser His Ser Thr Ser Ser Gly Ser Arg Asp 

H5 120 125 

Thr Gly Ala Thr His Pro Gly Tyr Asn Arg Arg Asn Ser 
130 135 140 

<210> 86 
<211> 155 
<212> PRT 

<213> Manduca sexta 
<220> 

<221> VARIANT 
<222> 3, 4 

<223> Xaa = Any Amino Acid 
<400> 86 

Pro Leu Xaa Xaa Gly Ser Pro Gly Leu Gin Ala Lys Ser Asn Phe Ala 

5 10 15 

Leu Gly Val He Ala He Leu Leu He Thr Glu Ser Asn Cys Trp Arg 

20 25 30 

Ser Asp Leu Pro He He Leu Pro Thr Tyr Lys Pro Pro Arg Thr Pro 

35 40 45 

Ser Thr He He He Arg Thr Val Arg Glu Ala Gly Asp Lys Pro Leu 

b0 55 60 

Trp Leu Tyr Gin Gly Asp Asp His Pro Gin Ala Pro Ser Ser Gly Asp 

7 70 75 80 

His Pro Val Leu Pro Ser He He Asp Asp Val Gin Leu Asp p ro Asn 

85 90 95 

Arg Arg Tyr Ala Arg Ser Val Ser Glu Pro Ser Ser Gin Asp His His 
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100 

Glu Ar^ Phe Val Arg Ser Phe Asp 
115 120 
Gly Ser His Ser Thr Ser Ser Gly 

130 135 
Pro Gly Tyr Asn Arg Arg Asn Ser 
145 150 



105 110 
Ser Arg Ser Ser Lys His His Gly 
125 

Ser Arg Asp Thr Gly Ala Thr His 
140 

Ser Val Val 
155 



<210> 87 
<211> 273 
<212> DNA 

<213> Manduca sexta 

<220> 

<221> CDS 

<222> (1) . . . (204) 

<400> 87 

atg aaa ttc tec cgt gtt tta ttc ttc gtc ttc get tgc ttc gec gca 48 

Met Lys Phe Ser Arg Val Leu Phe Phe Val Phe Ala Cys Phe Ala Ala 
1 5 io 15 

ttt aca gta act gcg gec aag cca tgg gac ttc tta aag gag ctg gag 96 
Phe Thr Val Thr Ala Ala Lys Pro Trp Asp Phe Leu Lys Glu Leu Glu 
20 25 30 

ggt gca ggt caa agg att cgt gac get ate ate age gcg cag ccg gcg 144 
Gly Ala Gly Gin Arg lie Arg Asp Ala lie He Ser Ala Gin Pro Ala 
35 40 45 

gtg gaa acc ate gcg cag gca acc gec att ttc aaa gga caa tea aaa 192 
Val Glu Thr He Ala Gin Ala Thr Ala He Phe Lys Gly Gin Ser Lys 
50 55 60 

gaa gaa gat taa ttgtgtcatt acagtattac atatttaagg atataatttt 244 
Glu Glu Asp * 
65 

attttgacaa tatattcatt taattcaac 273 

<210> 88 
<211> 67 
<212> PRT 

<213> Manduca sexta 
<400> 88 

Met Lys Phe Ser Arg Val Leu Phe Phe Val Phe Ala Cys Phe Ala Ala 

15 10 15 

Phe Thr Val Thr Ala Ala Lys Pro Trp Asp Phe Leu Lys Glu Leu Glu 

20 25 30 

Gly Ala Gly Gin Arg He Arg Asp Ala He He Ser Ala Gin Pro Ala 

35 40 45 

Val Glu Thr He Ala Gin Ala Thr Ala He Phe Lys Gly Gin Ser Lys 

50 55 60 

Glu Glu Asp 
65 



<210> 89 
<211> 60 
<212> PRT 

<213> Manduca sexta 



<400> 89 

Met Lys Phe Ser Arg Val Leu Phe Phe Val Phe Ala Cys Phe Ala Ala 
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1 5 10 is' 

Phe Thr Val Thr Ala Ala Lys Pro Trp Asp Phe Leu Lys Glu Leu Glu 

20 25 30 

Gly Ala Gly Gin Arg He Arg Asp Ala He He Ser Ala Gin Pro Ala 

35 40 45 

Val Glu Thr Tie Ala Gin Ala Thr Ala He Phe Lys 
50 55 60 



<210> 90 
<211> 418 
<212> DNA 

<213> Peregrinus maidis 
<220> 

<221> CDS 

<222> (1) . . . (192) 

<221> misc_f eature 

<222> 259, 305, 330, 340, 358, 359, 372, 380, 397, 417 
<223> n = A,T,C or G 

<400> 90 

atg aag ttc tec cga gtg ttc ctg ttc gtg ttc gcg tgc ctg gtc gcg 48 

Met Lys Phe Ser Arg Val Phe Leu Phe Val Phe Ala Cys Leu Val Ala 
1 5 10 15 

ctg age gec gtc age gec gcg cca gag ccg agg tgg aag gtc ttc aag 96 
Leu Ser Ala Val Ser Ala Ala Pro Glu Pro Arg Trp Lys Val Phe Lys 
20 25 3 0 

aag att gag aag atg ggc cgc aac ate aga gac ggt ate gtc aag gca 144 
Lys He Glu Lys Met Gly Arg Asn He Arg Asp Gly He Val Lys Ala 
35 40 45 

ggt cct get gtc gag gtg ttg ggt gca gee aaa gcg ctg ggg aag taa 192 
Gly Pro Ala Val Glu Val Leu Gly Ala Ala Lys Ala Leu Gly Lys * 
50 55 60 

tcagcagtat catcttcatc atcatcactt aatatcatca caagtcttat ggtgtgacca 252 

geatatnetg gtgaccaaca acccctttaa attcctaaac ccaccaaaaa ggncgggtaa 312 

cgcacttgtt acgcctcngg tgttttgnaa tgtccaaggg ggtggnnggc gattgettan 372 

ccatcaanaa tgattccttc tgatncgttt aaceggtaat ttccna 418 

<210> 91 
<211> 63 
<212> PRT 

<213> Peregrinus maidis 
<400> 91 



20 

He Glu Lys 
35 

Pro Ala Va3 
50 



Arg Val 


Phe 


Leu Phe Val 


Phe Ala Cys 


Leu Val 


Ala 


5 




10 


15 




Ser Ala 


Ala 


Pro Glu Pro 


Arg Trp Lys 


Val Phe 


Lys 






25 




30 




Met Gly 


Arg 


Asn He Arg 


Asp Gly He 


Val Lys 


Ala 






40 


45 




Glu Val 


Leu 


Gly Ala Ala 


Lys Ala Leu Gly Lys 






55 




60 







<210> 92 
<211> 63 
<212> PRT 

<213> Peregrinus maidis 
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<400> 92 










Met Lys 


Phe 


Ser Arg 


Val 


Phe 


Leu 


1 




5 








Leu Ser 


Ala 


Val Ser 


Ala 


Ala 


Pro 






20 








Lys lie 


Glu 


Lys Met 


Gly 


Arg 


Asn 




35 








40 


Gly Pro 


Ala 


Val Glu 


Val 


Leu 


Gly 


50 








55 





Phe 


Val 


Phe Ala 


Cys 


Leu 


Val 


Ala 




10 








15 




Glu 


Pro 


Arg Trp 


Lys 


Val 


Phe 


Lys 


25 








30 






lie 


Arg 


Asp Gly 


lie 


Val 


Lys 


Ala 








45 








Ala 


Ala 


Lys Ala 


Leu 


Gly 


Lys 





60 



<210> 93 
<211> 370 
<212> DNA 

<213> Peregrinus maidis 

<220> 

<221> CDS 

<222> (1) . . . (225) 



<400> 93 

atg aag ttc tec cga gtg ttc ctg ttc gtg ttc gcg tgc ctg gtc gcg 48 

Met Lys Phe Ser Arg Val Phe Leu Phe Val Phe Ala Cys Leu Val Ala 
1 5 10 15 



ctg age gec gtc age gec gcg cca gag ccg agg tgg aag gtc ttc aag 96 
Leu Ser Ala Val Ser Ala Ala Pro Glu Pro Arg Trp Lys Val Phe Lys 
20 25 3 0 

aag att gag aag atg ggc cgc aac ate aga gac ggt ate gtc aag gca 144 
Lys lie Glu Lys Met Gly Arg Asn lie Arg Asp Gly lie Val Lys Ala 
35 40 45 



ggt cct get gtc gag gtg ttg ggt gca age caa ggc get ggg gaa gta 192 

Gly Pro Ala Val Glu Val Leu Gly Ala Ser Gin Gly Ala Gly Glu Val 

50 55 60 

ate age agt ate ate ttc ate ate ate act taa tatcatcaca gtcttatggt 245 

lie Ser Ser lie lie Phe He He lie Thr * 
65 70 



gtgaccagca tatctggtga caacaaccct taaattccta acccaccaaa agggeggtaa 305 
cgcacttgtt acgcctcggg tgtttgaaat gtccaagggg tgggeggega ttgettacca 365 
acaag 370 

<210> 94 
<211> 74 
<212> PRT 

<213> Peregrinus maidis 
<400> 94 

Met Lys Phe Ser Arg Val Phe Leu Phe Val Phe Ala Cys Leu Val Ala 

1 5 io 15 

Leu Ser Ala Val Ser Ala Ala Pro Glu Pro Arg Trp Lys Val Phe Lys 

20 25 ~ 30 

Lys He Glu Lys Met Gly Arg Asn He Arg Asp Gly He Val Lys Ala 

35 40 " 45 

Gly Pro Ala Val Glu Val Leu Gly Ala Ser Gin Gly Ala Gly Glu Val 

50 55 so 

He Ser Ser He He Phe He He He Thr 
65 70 



<210> 95 
<211> 63 
<212> PRT 
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<213> Peregrinus maidis 
<400> 95 

Met Lys Phe Ser Arg Val Phe Leu 

1 5 
Leu Ser Ala Val Ser Ala Ala Pro 
20 

Lys lie Glu Lys Met Gly Arg Asn 

35 40 
Gly Pro Ala Val Glu Val Leu Gly 
50 55 



Phe Val Phe Ala Cys Leu Val Ala 

10 15 
Glu Pro Arg Trp Lys Val Phe Lys 
25 30 
lie Arg Asp Gly He Val Lys Ala 
45 

Ala Ser Gin Gly Ala Gly Glu 
60 



<210> 96 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide sequence from Lys-C digested Magi 
<400> 96 

Val Gly Ala Ser Leu Gly Ala Ala His Thr Asp Phe 
15 io 



<210> 97 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide sequence from Lys-C digested Magi 
<400> 97 

Asn Asn He Phe Ser Ala He Gly Gly Ala Asp Phe Asn Ala Asn His 

1 5 ia is 

Lys 



<210> 98 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide sequence from Lys-C digested Magi 
<400> 98 

Lys Phe Asp Thr Pro Phe Met Arg Ser Gly Trp Glu 
1 5 10 



<210> 99 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide sequence from Lys-C digested Magi 
<400> 99 

Leu Asn Leu Phe His Asn Asn Asn His Asp Leu Thr 
15 io 
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<210> 100 
<211> 358 
<212> DNA 

<:213> Agrotis ipsilon 

<220> 

<221> CDS 

<222> (1) . . . (195) 

<221> misc_f eature 
<222> (0) . . . (0) 
<223> Fus6 

<400> 100 

atg gcc gcc aac aag act ate ttc ctt etc gtg ctg ate gee ttc gca 48 

Met Ala Ala Asn Lys Thr lie Phe Leu Leu Val Leu He Ala Phe Ala 
15 10 15 

atg gtg atg gtg acc gtg gag gcc gtc cgt gtg gga ccc tgc gac cag 96 
Met Val Met Val Thr Val Glu Ala Val Arg Val Gly Pro Cys Asp Gin 
20 25 30 

gtc tgc age cgc ate gat get gag aag aac gag tgc tgc aga get cac 144 
Val Cys Ser Arg He Asp Ala Glu Lys Asn Glu Cys Cys Arg Ala His 
35 40 45 

ggc tac tec gga tac age age tgt aga tat ggg cag atg caa tgt tac 192 
Gly Tyr Ser Gly Tyr Ser Ser Cys Arg Tyr Gly Gin Met Gin Cys Tyr 
50 55 60 

tga cggaactcca caagagcaac agttttctaa ccactttttc aactttgtcc 245 



agaggtaatc aagattgect catcacttca aaggttcttt tttgtcattt attaacttgt 305 
tttcaaaatt aaccgattaa attaattaat ttaaaaaaaa aaaaaaaaaa aaa 358 

<210> 101 
<211> 64 
<212> PRT 

<213> Agrotis ipsilon 
<400> 101 

Met Ala Ala Asn Lys Thr He Phe Leu Leu Val Leu He Ala Phe Ala 

1 5 10 15 

Met Val Met Val Thr Val Glu Ala Val Arg Val Gly Pro Cys Asp Gin 

20 25 30 

Val Cys Ser Arg He Asp Ala Glu Lys Asn Glu Cys Cys Arg Ala His 

35 40 45 

Gly Tyr Ser Gly Tyr Ser Ser Cys Arg Tyr Gly Gin Met Gin Cys Tyr 
50 55 60 



<210> 102 
<211> 123 
<212> DNA 

<213> Agrotis ipsilon 

<220> 

<221> CDS 

<222> (1) . ... (123) 

<221> mi sc_f eature 
<222> (0) . . . (0) 
<223> Fus6 
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<400> 102 

gtc cgt gtg gga ccc tgc gac cag gtc tgc age cgc ate gat get gag 48 

Val Arg Val Gly Pro Cys Asp Gin Val Cys Ser Arg lie Asp Ala Glu 
1 5 10 15 

aag aac gag tgc tgc aga get cac ggc tac tec gga tac age age tgt 96 
Lys Asn Glu Cys Cys Arg Ala His Gly Tyr Ser Gly Tyr Ser Ser Cys 
20 25 30 

aga tat ggg cag atg caa tgt tac tga 123 
Arg Tyr Gly Gin Met Gin Cys Tyr * 
35 40 

<210> 103 
<211> 40 
<212> PRT 

<213> Agrotis ipsilon 
<400> 103 

Val Arg Val Gly Pro Cys Asp Gin Val Cys Ser Arg He Asp Ala Glu 

1 5 io 15 

Lys Asn Glu Cys Cys Arg Ala His Gly Tyr Ser Gly Tyr Ser Ser Cys 

20 25 30 

Arg Tyr Gly Gin Met Gin Cys Tyr 
35 40 

<21Q> 104 
<211> 387 
<212> DNA 

<213> Agrotis ipsilon 

<220> 

<221> CDS 

<222> (1) . . . (195) 

<221> misc_f eature 
<222> (0) . . . (0) 
<223> Fus7 

<400> 104 

atg gtt gec aac aag act ate etc ctt etc gtg ctg ate gee ttc qca 48 
Met val Ala Asn Lys Thr He Leu Leu Leu Val Leu He Ila Phe 111 

atg gtg atg gtg acc gtg gaa gee gtc cat gtg gga ccc tgc gac cag 96 
Met Val Met Val Thr Val Glu Ala Val His Val Giy Pro C^s Lp Gin 
20 25 30 

gtc tgc age cgc ate gac get gag aag gac gag tgc tgc aga get cac 144 
Val Cys Ser Arg He Asp Ala Glu Lys Asp Glu Cys Cys Arg Ala His 
35 40 45 

ggc cac tec ggc tac age age tgc aga tac gga cag atg caa tgt tac 192 
Gly His Ser Gly Tyr Ser Ser Cys Arg Tyr Gly Gin Met Gin Cys Tyr 
50 55 60 

tga cggtactccg caacaacaac ggtactatag tggagctatt gtgtaacttt 245 

tccaaataca tgtgaaagtt aactgtgata tttttaagtt cctttacttt tgaattegge 305 
atgtgattaa gttattgttt aataaaagga attatttatg aaaaaaaaaa aaaaaaaaaa 365 
aaaaaaaaaa aaaaaaaaaa aa ~ 0 " 

3 8 7 
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<210> 105 
<211> 64 
<212> PRT 

<213> Agrotis ipsilon 
<400> 105 

Met Val Ala Asn Lys Thr He Leu 

1 5 
Met Val Met Val Thr Val Glu Ala 
20 

Val Cys Ser Arg He Asp Ala Glu 

35 40 
Gly His Ser Gly Tyr Ser Ser Cys 
50 55 



Leu Leu Val Leu He Ala Phe Ala 

10 15 
Val His Val Gly Pro Cys Asp Gin 
25 30 
Lys Asp Glu Cys Cys Arg Ala His 
45 

Arg Tyr Gly Gin Met Gin Cys Tyr 
60 



<210> 106 
<211> 123 
<212> DNA 

<213> Agrotis ipsilon 

<220> 

<221> CDS 

<222> (1) . . . (123) 

<221> misc_feature 
<222> (0) . . . (0) 
<223> Fus7 

<400> 106 

gtc cat gtg gga ccc tgc gac cag gtc tgc age cgc ate gac get gag 4 8 

Val His Val Gly Pro Cys Asp Gin Val Cys Ser Arg He Asp Ala Glu 
1 5 io 15 

aag gac gag tgc tgc aga get cac ggc cac tec ggc tac age age tgc 96 
Lys Asp Glu Cys Cys Arg Ala His Gly His Ser Gly Tyr Ser Ser Cys 
20 25 " 30 

aga tac gga cag atg caa tgt tac tga 12; 
Arg Tyr Gly Gin Met Gin Cys Tyr * 
35 " 40 



<210> 107 
<211> 40 
<212> PRT 

<213> Agrotis ipsilon 
<400> 107 

Val His Val Gly Pro Cys Asp Gin Val Cys Ser Arg He Asp Ala Glu 

15 10 " 15 

Lys Asp Glu Cys Cys Arg Ala His Gly His Ser Gly Tyr Ser Ser Cys 

20 25 30 

Arg Tyr Gly Gin Met Gin Cys Tyr 
35 40 



<210> 108 
<211> 361 
<212> DNA 

<213> Agrotis ipsilon 

<220> 

<221> CDS 

<222> (1) . . . (195) 
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<221> misc_feature 
-:222> (0) . . . (0) 
<223> Fus8 



<221> misc_f eature 

<222> 327, 328, 329, 330, 331, 332, 333 
<223> n = A,T,C or G 

*:400> 108 

atg gtt gcc aac aag acc ate ttc ctt etc gtg ctg ate gec ttc gca 48 

Met Val Ala Asn Lys Thr He Phe Leu Leu Val Leu He Ala Phe Ala 

1 5 10 is 

atg gtg atg gtg acc gtg gag gcc gtc cgt gtg gga ccc tgc gac cag 96 
Met Val Met Val Thr Val Glu Ala Val Arg Val Gly Pro Cys lap Gin 
20 25 30 

gtc tgc age cgc ate gac get gag aag gac gag tgc tgc aga get cac 144 
Val Cys Ser Arg He Asp Ala Glu Lys Asp Glu Cys Cys Arg Ala His 
3 5 40 45 

ggc cac tec ggc tac age age tgc aga tac gga cag atg caa tgt tac 192 
Gly His Ser Gly Tyr Ser Ser Cys Arg Tyr Gly Gin Met Gin Cys Tyr 
50 55 60 

tga cggaactccg caacgacaac ggtactatag tggagctact gtgtaacttc 245 



tctaaatttc tattactttc gaatteggea tgtgataaag ttattgttta ataaaaggaa 305 
ttatttataa aaaaaaaaaa annnnnnnaa aaaaaaaaaa aaaaaaaaaa aaaaaa 361 



<210> 109 
<211> 64 
<212> PRT 

<213> Agrotis ipsilon 



<400> 109 

Met Val Ala Asn Lys Thr lie Phe Leu Leu Val Leu He Ala Phe Ala 

1 5 in _ _ 



Met Val Met Val Thr Val Glu Ala Val Arg Val Gly Pro Cys Asp Gin 

20 25 30 

Val Cys Ser Arg He Asp Ala Glu Lys Asp Glu Cys Cys Arg Ala His 

35 40 45 

Gly His Ser Gly Tyr Ser Ser Cys Arg Tyr Gly Gin Met Gin Cys Tyr 
50 55 60 



<210> 110 
<211> 123 
<212> DNA 

<213> Agrotis ipsilon 

<220> 

<221> CDS 

<222> (1) . . . (123) 

<221> misc_feature 
<222> (0) . . . (0) 
<223> Fus8 



<400> 110 



v»? w fc ? 2?* t9C ^ C 9tC t9 ° a9C ° gC atC 9 ac 9 ct 9*9 48 

Val Arg Val Gly Pro Cys Asp Gin Val Cys Ser Arg He Asp Ala Glu 

15 10 15 
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aag gac gag tgc tgc aga get cac ggc cac tec ggc tac age age tgc 96 
Lys Asp Glu Cys Cys Arg Ala His Gly His Ser Gly Tyr Ser Ser Cys 
20 25 30 

aga tac gga cag atg caa tgt tac tga 123 
Arg Tyr Gly Gin Met Gin Cys Tyr * 
35 40 



<210> ill 
<211> 40 
<212> PRT 

<213> Agrotis ipsilon 
<400> 111 

Val Arg Val Gly Pro Cys Asp Gin Val Cys Ser Arg lie Asp Ala Glu 

15 10 15 

Lys Asp Glu Cys Cys Arg Ala His Gly His Ser Gly Tyr Ser Ser Cys 

20 25 30 

Arg Tyr Gly Gin Met Gin Cys Tyr 
35 40 



<210> 


112 


<211> 


466 


<212> 


DNA 


<213> 


Agrotis ipsilon 


<220> 




<221> 


CDS 


<222> 


(1) . . . (291) 


<221> 


misc feature 


<222> 


(0) . . . (0) 


<223> 


Fus9 


<400> 


112 


atg aac aag caa ctg tta 


Met Asn Lys Gin Leu Leu 


1 


5 



c gtc ctt ttg gec atg tgc ctt gtc age 48 
1 Val Leu Leu Ala Met Cys Leu Val Ser 
10 ' 15 

get cac get ttc gtg aaa cgc gat gtc cca aca aat gca gac tta cag 96 
Ala His Ala Phe Val Lys Arg Asp Val Pro Thr Asn Ala Asp Leu Gin 
20 25 30 

gga caa eta gaa gee ttg aga aac ace ctt aat cag tta ace aac tea 144 
Gly Gin Leu Glu Ala Leu Arg Asn Thr Leu Asn Gin Leu Thr Asn Ser 
35 40 45 

gtc att aat caa act tea act gtt ttc gac ccg gaa gaa att aag aag 192 
Val lie Asn Gin Thr Ser Thr Val Phe Asp Pro Glu Glu lie Lys Lys 
50 55 " 60 

aat ate gat aaa gee att gac aca get age aaa gee att gat agt tta 240 
Asn lie Asp Lys Ala lie Asp Thr Ala Ser Lys Ala lie Asp Ser Leu 
65 70 75 80 

gtg aaa cca caa gga gga gaa gee cag ccc get gee cag cca gca gee 288 
Val Lys Pro Gin Gly Gly Glu Ala Gin Pro Ala Ala Gin Pro Ala Ala 
85 90 95 

taa ttttatgttt aagactgatt tttatgacca cataaaatac ctcaaataaa 341 
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acatcaaaat taatctgctt cttcctatct ttcagaaaac taaattaaat aaataattta 401 
tacgtctgct taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 461 
aaaaa 466 

<210> 113 
<211> 96 
<212> PRT 

<213> Agrotis ipsilon 



<400> 113 
























Met 


Asn 


Lys 


Gin 


Leu 


Leu Val Val 


Leu 


Leu 


Ala 


Met 


Cys 


Leu 


Val 


Ser 


1 


His 






5 






10 








15 




Ala 


Ala 


Phe 


Val 


Lys Arg Asp Val 


Pro 


Thr 


Asn 


Ala 


Asp 


Leu 


Gin 








20 






25 










30 






Gly 


Gin 


Leu 


Glu 


Ala 


Leu Arg Asn 


Thr 


Leu 


Asn 


Gin 


Leu 


Thr 


Asn 


Ser 






35 






40 










45 






Val 


lie 


Asn 


Gin 


Thr 


Ser Thr Val 


Phe 


Asp 


Pro 


Glu 


Glu 


lie 


Lys 


Lys 




50 








55 








60 






Asn 


lie 


Asp 


Lys 


Ala 


lie Asp Thr 


Ala 


Ser 


Lys 


Ala 


He 


Asp 


Ser 


Leu 


65 










70 






75 








80 


Val 


Liys 


Pro 


Gin 


Gly 
85 


Gly Glu Ala 


Gin 


Pro 
90 


Ala 


Ala 


Gin 


Pro 


Ala 
95 


Ala 



<210> 114 
<211> 222 
<212> DNA 

<213> Agrotis ipsilon 

<220> 

<221> CDS 

<222> (1) . . . (222) 

<221> misc_feature 
<222> (0) . . . (0) 
<223> Fus9 



<400> 114 

gat gtc cca aca aat gca gac tta cag gga caa eta gaa gec ttg aga 48 

Asp Val Pro Thr Asn Ala Asp Leu Gin Gly Gin Leu Glu Ala Leu Arq 
1 5 in 



10 is 



aac acc ctt aat cag tta acc aac tea gtc att aat caa act tea act 
Asn Thr Leu Asn Gin Leu Thr Asn Ser Val He Asn Gin Thr Ser Thr 



96 



20 25 



30 



gtt ttc gac ccg gaa gaa att aag aag aat ate gat aaa gec att gac 144 
Val Phe Asp Pro Glu Glu He Lys Lys Asn He Asp Lys Ala He Asp 
35 40 



45 



aca get age aaa gec att gat agt tta gtg aaa cca caa gga gga gaa 192 
Thr Ala Ser Lys Ala He Asp Ser Leu Val Lys Pro Gin Gly Gly Glu 
50 55 60 

gec cag ccc get gec cag cca gca gec taa 222 
Ala Gin Pro Ala Ala Gin Pro Ala Ala * 
65 70 

<210> 115 
<211> 73 
<212> PRT 

<213> Agrotis ipsilon 
<400> 115 

Asp Val Pro Thr Asn Ala Asp Leu Gin Gly Gin Leu Glu Ala Leu Arg 
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1 5 
Asn Thr Leu Asn Gin Leu Thr Asn 
20 

Val Phe Asp Pro Glu Glu He Lys 

35 40 
Thr Ala Ser Lys Ala He Asp Ser 

50 55 
Ala Gin Pro Ala Ala Gin Pro Ala 
65 70 



10 15 
Ser Val He Asn Gin Thr Ser Thr 
25 30 
Lys Asn He Asp Lys Ala lie Asp 
4 5 

Leu Val Lys Pro Gin Gly Gly Glu 
60 

Ala 



<210> 


116 


<211> 


372 


<212> 


DMA 


<213> 


Agrotis ipsilon 


<220> 




<221> 


CDS 


<222> 


(1) . . . (222) 


<221> 


misc feature 


<222> 


(0) . . . (0) 


<223> 


FuslO 


<221> 


mi sc_feature 


<222> 


24 2 


<223> 


n = A,T,C or G 


<400> 


116 


atg teg aaa age tac cag 


Met Ser Lys Ser Tyr Gin 


1 


5 



c 9^9 ttg ttg ttg gtg tgc etc acg ttc 48 
r Val Leu Leu Leu Val Cys Leu Thr Phe 
10 15 

ct 9 9^9 ate gtc teg tct ccg cag aat get gtc cag get gat gta cac 96 
Leu Val He Val Ser Ser Pro Gin Asn Ala Val Gin Ala Asp Val His 
20 25 30 

ate ggc age tgc gtg tgg gga get gtt gac tac act teg aac tgc aac 144 
He Gly Ser Cys Val Trp Gly Ala Val Asp Tyr Thr Ser Asn Cys Asn 
35 40 45 

aat gaa tgc aag egg cgt gga tac aaa gga gga cat tgt gga age ttc 192 
Asn Glu Cys Lys Arg Arg Gly Tyr Lys Gly Gly His Cys Gly Ser Phe 
50 55 60 

get aat gtt aat tgt tgg tgt gaa caa tag gacaacaatt taacattagn 242 
Ala Asn Val Asn Cys Trp Cys Glu Gin * 
65 70 

acactaaaca aaccatcaaa atttgeagae gtggacacct ttcatagttt ttataccttg 3 02 
tcactatggt ggatggacta tcaaaatggt tcatgatttt gaaatttgta tctttaatct 362 
eggactgatg 372 

<210> 117 
<211> 73 
<212> PRT 

<213> Agrotis ipsilon 
<400> 117 

Met Ser Lys Ser Tyr Gin Ser Val Leu Leu Leu Val Cys Leu Thr Phe 

15 10 is 

Leu Val He Val Ser Ser Pro Gin Asn Ala Val Gin Ala Asp Val His 

20 25 30 

He Gly Ser Cys Val Trp Gly Ala Val Asp Tyr Thr Ser Asn Cys Asn 

35 40 45 

Asn Glu Cys Lys Arg Arg Gly Tyr Lys Gly Gly His Cys Gly Ser Phe 
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50 55 60 

Ala Asn Val Asn Cys Trp Cys Glu Gin 
65 70 



<210> 118 
<211> 135 
<212> DNA 

<213> Agrotis ipsilon 

<220> 

<221> CDS 

<222> (1) . . . (135) 

<221> misc feature 
<222> (0) ... (0) 
<223> FuslO 

<400> 118 



gat gta cac ate ggc age tgc gtg tgg gga get gtt gac tac act teg 48 
Asp Val His He Gly Ser Cys Val Trp Gly Ala Val Asp Tyr Thr Ser 



1 5 



10 15 



aac tgc aac aat gaa tgc aag egg cgt gga tac aaa gga gga cat tgt 96 
Asn Cys Asn Asn Glu Cys Lys Arg Arg Gly Tyr Lys Gly Gly His Cys 
20 25 30 

gga age ttc get aat gtt aat tgt tgg tgt gaa caa tag 135 
Gly Ser Phe Ala Asn Val Asn Cys Trp Cys Glu Gin * 
35 40 

<210> 119 
<211> 44 
<212> PRT 

<213> Agrotis ipsilon 
<400> 119 

Asp Val His He Gly Ser Cys Val Trp Gly Ala Val Asp Tyr Thr Ser 

1 5 10 15 

Asn Cys Asn Asn Glu Cys Lys Arg Arg Gly Tyr Lys Gly Gly His Cys 

20 25 30 

Gly ser Phe Ala Asn Val Asn Cys Trp Cys Glu Gin 
35 40 

<210> 120 
<211> 243 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Codon biased nucleotide sequence encoding 
BAA-Fusl. Codon biased to Manduca sexta. 

<221> CDS 

<222> (1) . . . (243) 

<22l> sig_peptide 

<222> (1) . . . (72) 

<223> BAA signal sequence 

<221> misc_feature 
<222> (0) . . . (0) 
<223> BAA-Fusl 
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<400> 120 

atg gca aac aag cat ttg age ctg age etc ttt ttg gtt ctg eta gga 48 
Met Ala Asn Lys His Leu Ser Leu Ser Leu Phe Leu Val Leu Leu Gly 
-20 -is -io 

etc tea gee teg ctt get agt ggt gaa gac ccc aga tgt tec caa ccg 96 
Leu Ser Ala Ser Leu Ala Ser Gly Glu Asp Pro Arg Cys Ser Gin Pro 
-5 15 

ate get tec ggc gtg tgc ttc ggc aac att gag aag ttc gga tat gat 144 
lie Ala Ser Gly Val Cys Phe Gly Asn lie Glu Lys Phe Gly Tyr Asp 
10 15 20 

ate gac gag cac aaa tgc gtg cag ttt gta tac ggg ggc tgc ttc ggt 192 
lie Asp Glu His Lys Cys Val Gin Phe Val Tyr Gly Gly Cys Phe Gly 
25 30 35 40 

aat gat aac caa ttc gac tct ctg gag gaa tgc cag gcg gtc tgt cct 240 
Asn Asp Asn Gin Phe Asp Ser Leu Glu Glu Cys Gin Ala Val Cys Pro 
45 50 55 



taa 



<210> 121 
<211> 80 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> SIGNAL 
<222> (1) ... (24) 



<223> Codon biased nucleotide sequence encoding 
BAA-Fusl. Codon biased to Manduca sexta . 

<400> 121 

Met Ala Asn Lys His Leu Ser Leu Ser Leu Phe Leu Val Leu Leu Gly 

-2° -15 -io 

Leu Ser Ala Ser Leu Ala Ser Gly Glu Asp Pro Arg Cys Ser Gin Pro 

"5 1 5 

lie Ala Ser Gly Val Cys Phe Gly Asn lie Glu Lys Phe Gly Tyr Asp 

10 15 20 

lie Asp Glu His Lys Cys Val Gin Phe Val Tyr Gly Gly Cys Phe Gly 
25 30 35 " 40 

Asn Asp Asn Gin Phe Asp Ser Leu Glu Glu Cys Gin Ala Val Cys Pro 
45 50 55 



243 



<210> 122 
<211> 171 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Codon biased nucleotide sequence encoding 
BAA-Fusl. Codon biased to Manduca sexta. 

<221> raise feature 
<222> (0) . . . (0) 
<223> Fusl 

<221> CDS 

<222> (1) . . . (171) 
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<400> 122 



gaa gac ccc aga tgt tec caa ccg ate get tec ggc gtg tgc ttc ggc 4 8 
Glu Asp Pro Arg Cys Ser Gin Pro lie Ala Ser Gly Val Cys Phe Gly 
1 5 io 



15 



aac att gag aag ttc gga tat gat ate gac gag cac aaa tgc gtg cag 

Asn He Glu Lys Phe Gly Tyr Asp He Asp Glu His Lys Cys Val Gin 

20' 25 30 

ttt gta tac ggg ggc tgc ttc ggt aat gat aac caa ttc gac tct ctg 

Phe Val Tyr Gly Gly Cys Phe Gly Asn Asp Asn Gin Phe Asp Ser Leu 

35 40 45 



96 



144 



gag gaa tgc cag gcg gtc tgt cct taa 171 
Glu Glu Cys Gin Ala Val Cys Pro * 
50 55 



<210> 


123 










<211> 


80 










<212> 


PRT 










<213> 


Artificial Sequence 




<220> 












<223> 


BAA-Fusl 








<221> 


SIGNAL 








<222> 


(1) . . 


. (24) 








<223> 


BAA 










<400> 


123 










Met Ala Asn 


Lys His 


Leu 


Ser 


Leu 






-20 








Leu Ser Ala 


Ser Leu 


Ala 


Ser 


Gly 






-5 






He Ala Ser 


Gly Val 


Cys 


Phe Gly 


10 








15 




He Asp Glu 


His Lys 


Cys 


Val 


Gin 


25 






30 






Asn Asp Asn 


Gin Phe 


Asp 


Ser 


Leu 



45 



Ser Leu Phe Leu Val Leu Leu Gly 

-15 -io 
Glu Asp Pro Arg Cys Ser Gin Pro 

1 5 
Asn He Glu Lys Phe Gly Tyr Asp 
20 

Phe Val Tyr Gly Gly Cys Phe Gly 

35 40 
Glu Glu Cys Gin Ala Val Cys Pro 
50 55 



<210> 124 
<211> 207 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Codon biased nucleotide sequence encoding 
BAA-Pus2. Codon biased to Streptomyces 
coelicolor. 

<221> CDS 

<222> (1)...(207) 

<221> sig_peptide 

<222> (1) . . . (75) 

<223> BAA signal sequence 

<221> misc__f eature 
<222> (0) . . . (0) 
<223> BAA-Fus2 

<400> 124 
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atg gcg aac aag cac ctg tec etc tec etc ttc ctg gtc ctg ctg ggc 48 

Met Ala Asn Lys His Leu Ser Leu Ser Leu Phe Leu Val Leu Leu Gly 

-25 -20 -15 -10 

etc teg gcg acc ccg tec gee cag gcg gac gee ggc gac gag ccg ctg 96 

Leu Ser Ala Thr Pro Ser Ala Gin Ala Asp Ala Gly Asp Glu Pro Leu 

-5 15 

fc 99 ctg tac cag ggc gac gac cac ccc aga gee ccg age age ggg gac 144 

Trp Leu Tyr Gin Gly Asp Asp His Pro Arg Ala Pro Ser Ser Gly Asp 

10 15 20 

cac ccg gtg etc ccc teg ate ate gac gac gtc aag ctg gac ccc aac 192 

His Pro Val Leu Pro Ser He He Asp Asp Val Lys Leu Asp Pro Asn 

25 30 35 

egg cgc tac gec tga 207 
Arg Arg Tyr Ala * 
40 



<210> 125 
<211> 68 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> SIGNAL 
<222> (1) . . . (25) 

-:223> Codon biased nucleotide sequence encoding 
3AA-Fus2. Codon biased to Streptomyces 
coelicolor. 



<400> 125 












Met 


Ala 


Asn 


Lys 


His 


Leu 


Ser 


Leu 


-25 










-20 






Leu 


Ser 


Ala 


Thr 


Pro 


Ser 


Ala 


Gin 


Trp 


Leu 


Tyr 


Gin Gly 


Asp 


Asp 


His 






10 










15 


His 


Pro 


Val 


Leu 


Pro 


Ser 


He 


He 




25 










30 




Arg 


Arg 


Tyr 


Ala 











40 



Ser Leu Phe Leu Val Leu Leu Gly 
-15 -io 
Ala Asp Ala Gly Asp Glu Pro Leu 

1 5 
Pro Arg Ala Pro Ser Ser Gly Asp 
20 

Asp Asp Val Lys Leu Asp Pro Asn 
35 



<210> 126 

<211> 132 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Codon biased nucleotide sequence encoding Fus2 . 
Codon biased to Streptomyces coelicolor. 

<221> CDS 

<222> (1) . . . (132) 

<221> misc_feature 
<222> (0) . . . (0) 
<223> Fus2 

<400> 126 

gac gee ggc gac gag ccg ctg tgg ctg tac cag ggc gac gac cac ccc 48 
Asp Ala Gly Asp Glu Pro Leu Trp Leu Tyr Gin Gly Asp Asp His Pro 
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5 10 15 

aga gcc ccg age age ggg gac cac ccg gtg etc ccc teg ate ate gac 96 

Arg Ala Pro Ser Ser Gly Asp His Pro Val Leu Pro Ser He He Asp 
20 25 30 

gac gtc aag ctg gac ccc aac egg cgc tac gcc tga i 32 

Asp Val Lys Leu Asp Pro Asn Arg Arg Tyr Ala * 
35 40 



<210> 127 
<211> 68 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Codon biased nucleotide sequence encoding 
BAA-Fus2. Codon biased to Streptomyces 
coelicolor. 



<221> SIGNAL 
<222> (1) . . . (25) 
<223> BAA 



<400> 127 

Met Ala Asn Lys His Leu Ser Leu 

-25 -20 

Leu Ser Ala Thr Pro Ser Ala Gin 

-5 

Trp Leu Tyr Gin Gly Asp Asp His 

10 15 
His Pro Val Leu Pro Ser He He 

25 30 
Arg Arg Tyr Ala 
40 



Ser Leu Phe Leu Val Leu Leu Gly 
-15 -10 
Ala Asp Ala Gly Asp Glu Pro Leu 

1 5 
Pro Arg Ala Pro Ser Ser Gly Asp 
2 0 

Asp Asp Val Lys Leu Asp Pro Asn 
35 
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Q isolated from the hemolymph and fat bodies of insect larvae induced by injection of plant pathogenic fungi. Further provided are 
^ plant cells, plants, an seed thereof, transformed with the nucleic acid mole cules of the invenuon so as to confer disease resistance 
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